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LIGAND DIRECTED ENZYME PRODRUG THERAPY 

Introduction and backgr ound to the invention 

The present invention relates to ligand directed enzyme 
prodrug therapy (LIDEPT) and its use in the treatment of 
disease, including tumours. 

A therapeutic approach termed "antibody-directed enzyme 
prodrug therapy" (ADEPT) has been proposed as a method for 
treating tumour cells in patients using prodrugs. Tumour 
cells are targeted with an antibody conjugated to an enzyme 
capable of activating a prodrug. The antibody in the 
conjugate binds to tumour cells in order that the enzyme can 
convert the prodrug to an active drug outside the tumour 
cells (see e.g. WOB8/07378) . Alternatively, methods for the 
delivery of genes which encode an enzyme have been used (see 
e.g. EP-A-415731) . Such methods include calcium phosphate 
co-precipitation, microinjection, liposomes, directed DNA 
uptake, and receptor-mediated DNA transfer. These are 
reviewed in Morgan & French Anderson, Annu. Rev. Biochem. , 
1993, £2; 191. The term "GDEPT" (gene-directed enzyme 
prodrug therapy) is used to include both viral and non-viral 
delivery systems. 

The present invention relates * to a new class of enzyme 
delivery systems, involving the use of ligands which 
recognise receptors on the surface of tumour cells. Tumour 
cells are targeted with a ligand-enzyme conjugate or a 
ligand-enzyme fusion. 

Disclosure of the inventio n 

The present invention provides a two component system for use 
in association with one another comprising: 

(a) a fusion protein or conjugate of a ligand with 
enzyme; and 

(b) a prodrug which can be converted into an active 
drug by said enzyme . 
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The ligand component of the above system is preferably a 
ligand which is: 

(i) a naturally occurring polypeptide whose 
biological role is to bind to a cognate receptor 
on the surface of the cell; or 
<ii) a fragment of said polypeptide which still binds 

to its cognate receptor; or 
(iii) derivatives of (i) and (ii) above with altered 
receptor specificity. 

The invention also provides the system of the invention for 
use in a method of treatment of a patient, and a method of 
^treating a tumour in a patient in need of treatment which 
comprises administering to said patient an effective ligand- 
enzyme capable of binding to the receptor and a prodrug 
capable of being converted by said enzyme to an active drug. 
The invention further provides novel conjugates of a ligand 
(or fragments or derivatives thereof) with an enzyme. 

Brief description of the drawings 

Figure 1 shows schematically a ligand, an enzyme, and 
some fusion proteins containing them. 

Detailed description of the invention. 

Ligands which are naturally occurring polypeptides whose 
biological role is to bind to a cognate receptor are 
preferably naturally occurring mammalian polypeptides. 
Desirably, such polypeptides will be selected from the 
species of mammal which is to be treated by the invention. 
Thus, when the system of the invention is intended for human 
use, the polypeptide will preferably be of human origin. In 
veterinary applications, the polypeptide may be, for example, 
bovine, ovine, porcine, canine or feline. In research 
applications, the polypeptide may be of these or other 
origin, e.g. primate, rodent (such as mouse, rat or rabbit). 
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Examples of ligands include epidermal growth factor, EGF, 
(which binds to epidermal growth factor receptor) , heregulin 
and c-erbB2 ligand. EGFR and c-ErbB2 are expressed in a 
number of tumour types. 

5 In general, examples of suitable ligands include those that 
recognise receptors that are expressed exclusively or, in 
greater number, on tumour cells or on vasculature that is in 
the vicinity of the tumour. 

Such an example of the latter is the vascular endothelial 

10 growth factor (VEGF) receptor, VEGF is a regulator of tumour 
angiogenesis which is produced by malignant cells and acts on 
tumour endothelial cells which express high affinity VEGF 
receptors. The known VEGF receptors are tyrosine kinases 
called flt-1 and flk-1, which are specifically expressed in 

15 endothelial cells of the tumour and in the border between the 
tumour and normal tissue (Science 251* 989, 1992; Cell 72 
835, 1993) . Flk-1 is specifically expressed in endothelial 
cells during embryonic development but is down-regulated in 
the adult, when angiogenesis stops. Tumours are dependent on 

20 angiogenesis since they require a continuous source of 
nutrients and oxygen by tumour vasculature in order to grow 
further. Therefore the ligand, VEGF which binds to flt-1 
and/or flk-1 may be used to selectively target an enzyme to 
the tumour environment. The advantage in targeting the 

25 tumour endothelium is that it overcomes the problem of poor 
penetration of large molecules (eg. antibodies) due to lack 
of accessibility. Also, the clearance time in vivo for a 
ligand is much shorter, so that less time is required between 
administration of the ligand-enzyme, for clearance from 

30 normal tissues and blood, before injection of the prodrug, 
than is required for antibody-enzyme conjugates. 

The VEGF may be conjugated to, or made as a fusion protein 
with, a non- endogenous enzyme that can selectively catalyse 
a prodrug to a toxic drug in LIDEPT. In this LIDEPT system, 
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flt-1 and/or flk-1 would effectively be targeted with a non- 
endogenous enzyme that converts prodrugs to toxic drugs. 
Enzymes and prodrugs which may be used include those 
disclosed in W088/07378, WO89/10140, WO90/02729, EP-A-415 
5 731, WO91/03460, WO93/08288, W094/25429, WO95/02420, 

WO95/03830, PCT/GB95/01783 and FCT/GB95/01782 , the 
disclosures of which are incorporated herein by reference. 

Using this anti-angiogenic approach will serve a triple 
purpose since the toxic drugs are small molecules that are 

10 able to diffuse through the tumour vasculature and the 
tumour. Firstly it will kill the tumour vasculature 
^endothelial cells. Secondly, it will kill the tumour cells 
by exerting a bystander effect. Thirdly, it will also kill 
the cells of the tumour by starving them of nutrients and 

15 oxygen. 

Fragments of a ligand may also be used, providing they still 
bind to the cognate receptor to which the ligand binds. 
Desirably, the binding will be of substantially similar 
affinity, e.g. from about 10-fold lower to 10-fold higher, or 
20 higher affinity, e.g. up to about 20 to 100-fold higher. 

The binding will also be of substantially similar selectivity 
for the cognate receptor, so that the fragment does not cross 
react or bind to other cell* surface receptors to a 
substantially greater degree than the ligand itself. 

25 The binding affinity of fragments may be determined by 
standard techniques, e.g. I 125 ligand competition assays. The 
binding specificity may be determined by e.g. I 125 ligand- 
enzyme biodistribution in vitro. 

Fragments of ligands may be generated in any desired way. 
30 For example, ligands may be cleaved chemically, eg. using 
trypsin or other protein fragmentation compounds such as 
cyanogen bromide, or other proteolytic enzymes. Fragments of 
the ligands may be separated by chromatography and recovered. 
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Conditions for cleavage, chromatography and recovery of 
proteins are well known in the art. More usually, fragments 
will be generated by recombinant DNA techniques. DNA 
encoding the ligands can be spliced at restriction sites to 
5 delete regions of the DNA encoding the ligand. Where the 
fragment of the ligand is a C-terminal truncated fragment, 
the DNA may be altered by site directed mutagenesis to 
introduce a stop codon at the desired point of truncation. 
Other methods of manipulating DNA may be found in standard 
10 reference books, eg Sambrook et al, Molecular Cloning, CSH, 
1987. 

The truncated DNA may then be expressed in a recombinant 
expression system to produce the ligand fragment, and the 
fragment recovered. As described herein, the DNA encoding 
15 the fragment may also be linked to DNA encoding the enzyme of 
the LIDEPT system so as to provide a fusion protein. 

Derivatives of ligands and their fragments may also be made 
by recombinant DNA techniques. Site directed mutagenesis may 
be used to introduce amino acid substrations # deletions or 
2 0 insertions into the ligand. One class of substitution which 
may be introduced is conservative substitutions. 

Conservative substitutions may. be made according to the 
following table, where amino acids on the same block in the 
second column and preferably in the same line in the third 
25 column may be substituted for each other: 
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As with ligand fragments, it is preferred that the 
derivatives bind to their cognate receptor with substantially 
similar or higher affinity and selectivity as the ligand 
itself, wherein the preferred degree of selectivity and 
5 affinity is as defined above for ligand fragments. 

B. Enzymes 

The enzyme may be any enzyme which is not normally expressed 
on the surface of a cell, nor released into the circulation, 
particularly a mammalian (especially human) cell, and which 

10 is capable of converting a prodrug into an active drug. The 
enzyme may be a mammalian enzyme which does not naturally 
occur in a human or a human enzyme which is not normally 
accessible to the prodrug. This includes enzymes from other 
species as well as mammalian enzymes which are altered in a 

15 manner which is selective for the prodrug. In other words, 
the alteration means that the conversion of the prodrug to an 
active drug by the natural enzyme will be at a rate one or 
more orders of magnitude less than the rate at which the 
altered enzyme operates. Altered enzymes may be made by 

20 standard recombinant DNA techniques, e.g. by cloning the 
enzyme, determining its gene sequence and altering the gene 
sequence by methods such as site-directed mutagenesis. 

The enzyme will usually convert the prodrug into an active 
drug by removing a protecting group from the prodrug. In 

25 most cases, the protecting group will be cleaved as a whole 
from the prodrug. However, it is also possible for the 
enzyme to cleave or simply alter part of the protecting 
group, resulting in a partially cleaved or altered protecting 
group which is unstable, resulting in spontaneous removal of 

3 0 the remainder of the group. Such prodrugs are of particular 
use in association with the nitroreductase enzyme described 
below. 

Preferably, the enzyme is a non-mammalian enzyme. Suitable 
non-mammalian enzymes include bacterial enzymes. Bacterial 
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enzymes include carboxypeptidases, such as carboxypeptidase 
G2 (CPG2 ) , as disclosed in WO88/07378, and Pgsudpmpnflg 
7-glutamylhydrolase EC3.4.22.12 (Levy CC & Goldstein P J. 
Biol. Chenu 242; p2933 (1967)) and nitroreductases, such as 
5 an E.coli nitroreductase as disclosed in WO93/08288. 
Examples of other suitable enzymes include thymidine kinase 
(tk) , especially viral tk such as VZV or HSV tk; and 0- 
lactamase and 0-glucoronidase . Other enzymes include 
penicillin V amidase, penicillin G amidase and cytosine 
10 deaminase. 

Fusion proteins will be expressed in both eukaryotic (e.g. 
insect, mammalian) cells and non-eukaryotic (bacterial) cells 
as required for each ligand. Many ligands are processed 
through the Golgi apparatus and endoplasmic reticulum, where 

15 they become glycosylated; this may be important for binding 
activity. In these cases, eukaryotic based expression 
systems will be employed. In the case of fusion proteins, 
especially for fusions between non-eukaryotic enzymes and 
ligands, where the ligand needs to pass through the golgi, 

20 the enzyme may also pass through the golgi/ER and may 
consequently also become glycosylated; this may lead in a 
reduction in activity of the enzyme compared to its non- 
glycosylated form. 

In a preferred aspect of the invention the enzyme has been 
25 altered by substitution, deletion or insertion at one or more 
(e.g. two, three or four) glycosylation sites. For example, 
within the primary amino acid sequence of CPG2, there are 
three such consensus motifs, located at residues Asn 222, Asn 
264 and Asn 272. Alteration of one or more of these sites is 
3 0 preferred when CPG2 is used in the present invention. 

Desirably, the alteration is substitution to leucine or 
glutamine . 

In general, alterations to enzymes which are amino acid 
substitutions of from at least 1 to 11 glycosylation sites 
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are particularly preferred, although deletions or insertions 
of for example 1, 2, 3, 4, 5 or more amino acids are also 
possible. In any event, the alteration will be such that the 
enzyme retains its ability to convert a prodrug to an active 
5 drug at substantially the same rate as the unchanged, 
unglycosylated enzyme. In this context, "substantially 
unchanged" will desirably be within 1 order of magnitude, and 
preferably from about 2-fold less to 2, 5 or 10 fold more. 

In addition to specific changes the enzyme may otherwise be 
10 altered by truncation, substitution, deletion or insertion as 
long as the activity of the enzyme is substantially unchanged 
as defined above. For example, small truncations in the N- 
and/or C- terminal sequence may occur as a result of the 
manipulations required to produce a vector in which a nucleic 
15 acid sequence encoding the enzyme is linked to the various 
other signal sequences described herein. The activity of the 
altered enzyme may be measured in suitable model systems 
which can be prepared in routine ways known in the art. 

In a further aspect of the present invention, there is 

2 0 provided a vector comprising a bacterial carboxypeptidase 

gene which has been altered by substitution, deletion or 
insertion at one or more glycosylation sites, fused in- frame 
to a gene encoding a ligand or fragment or derivative 
thereof. The ligand gene is fused at either the 5' or 3' end 
25 of the carboxypeptidase gene, and the two may be fused 
directly or spaced by a linker sequence encoding one or more 
(eg. up to 10, 20 or 100) amino acids. The fusion will be 
operably linked to a promoter capable of expressing the 
fusion construct in a host cell. The carboxypeptidase is 

3 0 desirably CPG2 with the amino acid sequence of SEQ ID No, 2 

except for said one or more substitutions, deletions or 
insertions. Variants of such carboxypeptidases containing 
further substitutions, deletions or insertions but which 
retain substantially unchanged carboxypeptidase activity are 
35 a further part of this aspect of the invention. Such 
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variants may for example include truncated enzymes as 
discussed above. 

Alterations may also be introduced into the sequences 
encoding the ligand. Truncation, insertions deletions and 
5 specific point mutations may be required to stabilise the 
fusion protein and to prevent it from being internalised. 
Also, additional "linker regions" may be required between the 
ligand and enzyme to give flexibility between the two 
components and thus allow each to be active. 

10 The invention also provides a nucleic acid which may be RNA 
or DNA encoding such fusion protein and vectors comprising 
such a nucleic acid. The nucleic acid fusion is preferably 
between that of SEQ ID No. 2 for the enzyme (except where 
altered to remove one or more glycosylation sites such as in 

15 SEQ ID No. 4), and SEQ ID No. 6 for the ligand or fragments 
thereof encoding the above mentioned variants of 
carboxypeptidase. The vector may be an expression vector, 
wherein said nucleic acid is operably linked to a promoter 
compatible with a host cell for expression of the fusion 

20 protein. The invention thus also provides a host cell which 
contains an expression vector of the invention. The host 
cell may be bacterial (e.g. E.coli ) . insect, yeast or 
mammalian (e.g. hamster or human). 



Host cells of the invention may be used in a method of making 
25 a fusion protein or fragment thereof as defined above which 
comprises culturing the host cell under conditions in which 
said enzyme or fragment thereof is expressed, and recovering 
the enzyme or fragment thereof in substantially isolated 
form. Polyhistidine residues in conjunction with nickle- 
30 affinity based systems may be used for this purpose. 



C. Other vector components 

In the system according to the invention the fusion may be 
linked to a signal sequence which directs the fusion to be 
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exported from the cells. This will usually be a mammalian 
signal sequence or a derivative thereof which retains the 
ability to direct the enzyme to the cell surface. Some 
ligands have a naturally occurring signal sequence, which may 
also be used. For example the VEGF signal sequence may be 
employed for this purpose. If the fusion does have such a 
signal sequence, it can be replaced by another signal 
sequence where this is desirable or appropriate. Suitable 
signal sequences include those found in transmembrane 
receptor tyrosine kinases such as the c-erbB2 (HER2/neu) 
signal sequence or variants thereof which retain the ability 
to direct expression of the enzyme at the cell surface. The 
c-erbB2 signal sequence can be obtained by reference to 
Coussens et al (1985) Science 230 : 1132-1139. 

The experiments described in the Examples herein may be used 
to determine variants of this, or other signal sequences, for 
their ability to express the enzyme at the cell surface. The 
variants may be produced using standard techniques known as 
such in molecular biology, eg. site-directed mutagenesis of 
a vector containing the signal sequence. 

Further suitable signal sequences include those which may be 
found in the review by von Heijne (1985) J. Mol . Biol. 184 : 
99-105. 

Vectors encoding the ligand and enzyme, together with, when 
required, a signal sequence may be made using recombinant DNA 
techniques known per se in the art. The sequences encoding 
the ligand enzyme and signal sequence may be constructed by 
splicing synthetic or recombinant nucleic acid sequences 
together, or modifying existing sequences by techniques such 
as site directed mutagenesis. Reference may be made to 
"Molecular Cloning" by Sambrook et al (1989, Cold Spring 
Harbor) for discussion of standard recombinant DNA 
techniques . 
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D. Promoter s 

The enzyme will be expressed in the vector using a promoter 
capable of being expressed in the cell in which the vector is 
to be expressed. The promoter will be operably linked to the 
5 sequences encoding the enzyme and its associated sequences. 
Suitable host cells include those mentioned above, and 
promotes from such host cells may be used. Viral promoters 
operable in such host cells may also be used. 

E. Prodrugs 

10 The prodrug for use in the system will be selected to be 
compatible with the enzyme, ie. such that the enzyme will be 
capable of converting the prodrug into an active drug. 
Desirably, the toxicity of the prodrug to the patient being 
treated will be at least one order of magnitude less toxic to 

15 the patient than the active drug. Preferably, the active 
drug will be several, eg 2, 3, 4 or more orders of magnitude 
more toxic. Suitable prodrugs include nitrogen mustard 
prodrugs and other compounds such as those described in 
WO88/07378, WO89/10140, WO90/02729, WO91/03460, EP-A-540 263, 

20 WO94/02450, WO95/02420 or WO95/03830 which are incorporated 
herein by reference. 

E(i) - Nitrogen mustard prodrugs 

Nitrogen mustard prodrugs include compounds of the formula: 

M-Ar-CONH-R 

25 where Ar represents an optionally substituted ring aromatic 
ring system, R-NH is the residue of an a-amino acid R-NH 2 or 
oligopeptide R-NH 2 and contains at least one carboxylic acid 
group, and M represents a nitrogen mustard group. 

The residue of the amino acid R-NH is preferably the residue 
30 of glutamic acid. It is disclosed in WO88/07378 that the 
enzyme carboxypeptidase G2 is capable of removing the 
glutamic acid moiety from compounds of the type shown above, 
and the removal of the glutamic acid moiety results in the 
production of an active nitrogen mustard drug. 
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Thus nitrogen mustard prodrugs of use in the invention 
include the prodrugs of generic formula I of WO94/02450 and 
salts thereof, and in particular those of formula (I) : 




(I) 



wherein R 1 and R 2 each independently represent chlorine, 
5 bromine, iodine, OS0 2 Me, OS0 2 phenyl (wherein phenyl is 
optionally substituted with 1,2,3,4 or 5 substituents 
independently selected from C^alkyl, halogen, -CN or N0 2 ) ; 
R la and R 2 * each independently represents hydrogen, alkyl 
or C a . 4 haloalkyl; 
10 R 3 and R 4 each independently represents hydrogen, C x . 4 alkyl or 
C!_ 4 haloalkyl; 

n is an integer from 0 to 4; 

each R 5 independently represents hydrogen, C A . 4 alkyl 
optionally containing one double bond or one triple bond, C 1 . 4 
15 alkoxy, halogen, cyano, -NH 2 , -CONR 7 R 8 (wherein R 1 and R 8 are 
independently hydrogen, C a _ 6 alkyl or C 3 . € cycloalkyl) or two 
adjacent R 5 groups together represent 

a) C4 alkylene optionally having one double bond; 

b) C3 alkylene; or 

20 c) -CH=CH-CH=CH- , -CH-CH-CH2 - or -CH2-CH=CH- each optionally 

substituted with 1, 2, 3 or 4 substituents said substituents 
each independently selected from the group consisting of Cj_ 4 
alkyl, Cj.4 alkoxy, halogen, cyano and nitro; 
X is a group -C(0)-, -O-C(O)-, -NH-C(O)- or -CH a -C(0)-; and 

25 Z is a group -CH 2 -T-C (O) -OR 6 where T is CH 2 , -0-, -S-, - (SO) - 
or"-(S0 2 )-, and R € is hydrogen, C : . 6 alkyl, C 3 . 6 cycloalkyl 
amino, mono- di-C^ alkylamino or mono or diC 3 _ 6 cycloalkyl 
amino, provided that when R 6 is hydrogen T is -CH 2 -; and 
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physiologically acceptable derivatives, including salts, of 
the compounds of formula (I) . 

Halogen includes fluorine, chlorine, bromine and iodine. 
Preferred values for the groups R la and R 2 * are methyl and 
hydrogen, especially hydrogen. Preferred values for the 
groups R 3 and R 4 are hydrogen, methyl and trif luoromethyl , 
especially hydrogen. Preferred values for the groups R 1 and 
R 2 are I, Br, CI, OS0 2 Me and OS0 2 phenyl wherein phenyl is 
substituted with one or two substituents in the 2 and/or 4 
positions. I, CI and OS0 2 Me are especially preferred. 

Preferred values for R 5 when n is an integer from 1 to 4 are 
fluorine, chlorine, methyl -CONH 2 and cyano. Preferably, n is 
0, 1 or 2 . When n is 1 or 2 it is preferred that R s is 
fluorine at the 3 and/or 5 positions of the ring. The group 
X is preferably -C(O)-, -O-C(O)- or -NH-C(O) - . Z is 
preferably a group -CH 2 CH 2 -COOH. 

Preferred specific compounds include: 

N-4- [ (2-chloroethyl) (2-mesyloxyethyl ) amino] benzoyl-L-glutamic 
acid (referred to below as " CMDA" ) and salts thereof; 
N- (4- [bis (2-chloroethyl) amino] -3 - f luorophenylcarbamoyl ) -L- 
glutamic acid and salts thereof; 

N- (4 - [bis (2-chloroethyl) amino]phenylcarbamoyl ) -L-glutamic 
acid and salts thereof; 

N- (4 - [bis (2-chloroethyl) amino] phenoxycarbonyl) -L-glutamic 
acid and salts thereof; and 

£J- (4- [bis (2 -iodoethyl) amino] phenoxycarbonyl) -L-glutamicacid 
(referred to below as "prodrug 2") and salts thereof. 

Particular sub-groups of the compounds of the present 
invention of interest may be obtained by taking any one of 
the above mentioned particular or generic definitions for R 1 - 
R\ R 5 , X or W either singly or in combination with any other 
particular or generic definition for R*-R 4 , R 5 , X or W. 
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compounds of the formula 
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(II) : 




wherein R 1 , R 2 , R 5 , n and Z are as defined for compounds of 
the formula (I) above; 
m is an integer from 0 to 4, 
5 Z 1 and Z 2 are each independently -0- or -NH-; and 
R 9 is hydrogen, t-butyl or allyl; 

and physiologically acceptable derivatives of the compound of 
formula (I). Preferred values of R 1 , R 2 , R\ n and Z are as 
defined above for compounds of the formula (I) . Preferred 
10 values of m are 0, 1 or 2 as defined for n above. R 9 is 
preferably hydrogen, but can be protected especially during 
synthesis by groups such as allyl or t-butyl. 

These prodrugs can be activated at the site of a tumour by a 
carboxypeptidase enzyme, for example, CPG2 as disclosed in 
15 WO88/07378 or WO94/02450. 
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Nitrogen mustard prodrugs of the formula (III) : 




(in) 



wherein R\ R 2 , R s , Z\ n, and m are as defined for compounds 
of the formula (II), and physiologically acceptable 
derivatives thereof, may also be used in the present 
5 invention. 

These prodrugs can be activated at the site of a tumour by a 
nitroreductase enzyme, for example, as disclosed in 
WO93/08288 . 

Usually to ensure enzyme activity a cofactor such as riboside 
10 or a ribotide of nicotinic acid or nicotinamide will be 
required and may be administered with the prodrug. 

Compounds of the formulae (II) and (III) may be made using 
reactions and methods known per se in the art of chemistry, 
and also by reference to GB patent application 9501052.6 and 
15 PCT/GB96/ filed 19 January 1996 which claims priority 

from it. The following methods are of particular use: 



A: Compounds of formula ITT) vhert Z 1 Lb -0- 
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Compounds of the formula (I) in which Z 1 is -0- may be made 
by reacting a nitrogen mustard of formula (IV) 




R5(n 

where R\ R 2 , and R B and n are as defined above and Z* is -O- 
with a linker of formula (V) 




5 where R 5 , m, Z 2 , R 9 and Z 3 are as defined above, and Q is 
hydrogen or a leaving group. This reaction may be done in 
aprotic solvents in the presence of a base, for example DMF 
and triethylamine . 

Preferred leaving groups Q include a succinimidyl group, a 
10 4-nitrophenyl carbonate group, a pentaf luorophenyl carbonate 
and a tetrachloroethyl group CH(Cl)CCl 3 . 

(ii) Compounds of the formula (IV) may be made starting from 
4-nitrophenol optionally substituted with the group (s) R 5 (n) 
(as defined above) . The phenolic group is protected as an 
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adamantanyloxycarbonyl-derivative (by reacting the starting 
materials with adamantanyl-f luorof ormate and triethylamine in 
THF at rt) . The protected 4-nitrophenyl carbonate is reduced 
to the corresponding amine by hydrogen transfer in ethanol 
5 using ammonium formate and Pd/C 10% as catalyst at room 
temperature. The amine is then hydroxyethylated with ethylene 
oxide in AcOH at 20©c and then reacted to the desired 
nitrogen mustard. Reference may be made to EP-A-433 360 or 
EP-A-490970 for suitable conditions. The compounds may be 
10 purified by column chromatography. Deprotection to remove 
the adamantyl group may be carried out in trif luoroacetic 
acid. 

(iii) Alternatively, the nitrogen mustard of formula (IV) may 
be activated as a chlorof ormate by treatment with phosgene or 
15 triphosgene in an aprotic solvent and triethylamine followed 
by coupling with a compound of formula (VI) : 




(VI) 



where R 5 , m, Z 2 , R 9 and Z 1 are as defined above. This may be 
carried out in THF or other aprotic solvents in the present 
of a base (for example triethylamine or pyridine) . 

(iv) A further alternative route of synthesis of 

compounds of the formula (II) in which Z 1 is -0- involves 
direct coupling of 4-nitrophenol optionally substituted with 
the group (s) R 5 , n) (as defined above) with the compound of the 
formula (V) or by reaction of the said optionally substituted 
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4-nitrophenol compound chlorof ormate with the compound of 
formula (V) , followed in each case by the reaction described 
above to convert the nitro group, via an amine, to a mustard 
group . 

5 B; Compounds of fo rmula (II) where Z 1 is -NH- ; 

(i) Compounds of the formula (II) in which Z 1 is -NH- 
may be made by reaction of a compound of formula (IV) in 
which Z 4 is -NH- with a linker of the formula (V) in aprotic 
solvents and in the presence of a base. Compounds of the 

10 formula (IV) in which Z< is -NH- may be made from a l-halo-4- 
nitrobenzy compound, optionally substituted with the group (s) 
R 5 (n) (as defined above) . This is converted to the 
corresponding l-bis-hydroxyethylamino-4-nitro-benzyl compound 
by reaction with diethanolamine with heat and the resulting 

15 product purified by column chromatography. The corresponding 
4 -nitro nitrogen mustard may be made by for example 
mesylation using mesyl chloride in pyridine and subsequent 
reaction to other halo mustards, e.g. bromo or iodo mustards 
if required. The 4 -nitro group may be reduced by hydrogen 

2 0 transfer in ethanol using ammonium formate and a Pd/C 10% 

catalyst at 20oC. 

(ii) Alternatively the i-bis-hydroxyethylamino-4 - 
nitrobenzyl compound mentioned above can be reduced using 
ammonium formate and Pd/C 10% as catalyst in ethanol at 2 0<>c 

25 to provide the corresponding phenylene-diamino derivative. 
This derivative can be converted into the corresponding 4- 
amino nitrogen mustard as described in the above paragraph, 
e.g. initially by reaction with mesyl chloride. 

C: Compounds of formula (III) : 

3 0 (i) Compounds of the formula (III) may be obtained by 

coupling nitrogen mustard phenol compounds described in 
section A(i) above with 4 -nitrobenzyl cholorof ormate 
optionally substituted with the group (s) R 5 (m , (as defined 
above) in the presence or absence of triethylamine at 20°C. 
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(ii) Alternatively aniline nitrogen mustards as 

described in section B{ii) above may be used with the 
chloroformated as described in section C(i) above. 

p« Compounds of the formul a fV) in which Z 2 is -NH- i 
(i) Compounds of the formula (IV) in which Z 2 is -NH- 

may be made from a 4-nitro benzylic alcohol optionally 
substituted with the group (s) R 5 (n) (as defined above). The 
hydroxyl function is protected as a pyranyl- or 
t-butyl-dimethylsilyl (TBDMSi) -ether by treatment at 20oc 
with 3, 4-2H-dihydropyran and pyridinium-p-toluensulf onate 
(PPTS) in an aprotic solvent or with TBDMSi chloride and 
imidazole in dimethylf ormamide (DMAC) , respectively. The 
intermediate thus obtained is reduced to the corresponding 
amine by hydrogen transfer in ethanol using ammonium formate 
and Pd/C 10% as catalyst at 20°C. This amine is converted to 
a glutamyl ester intermediate of formula (VII) : 




(VII) 



where R 5 , m, R 9 and Z 1 are as defined above, Z 2 is -NH- and Pr 
is the pyranyl- or t-butyl-dimethylsilyl (TBDMSi) -ether 
protecting group. This may be done by treating the amine 
with triphosgene and triethylamine in toluene at 60oC to 
provide the corresponding isocyante, which is treated with a 
glutamate derivative of formula R 9 -C (O) -CH (NH 2 ) -Z 1 where R 9 
and Z are as defined above. Alternatively the corresponding 
glutamyl -isocyanate obtained from the corresponding glutamate 



WO 97/26918 



PCT/GB97/00221 



- 20 - 

by treatment with triphosgene and triethylamine in toluene at 
-78oc may be reacted with the amine in a one pot procedure. 

(ii) The compound of formula (VII) is deprotected to remove 
the TBDMSi or pyranyl groups by treatment with mild acidic 

5 media (AcOH, THF and H 2 0 or PPTS, EtOH, 55°C) . This yields 
a compound of formula (VII) in which Pr is hydrogen. 
Compounds of the formula (V) in which Q is a leaving group 
may be prepared using standard reactions known in the art. 

(iii) For example where Q is a succinimidyl group the 
10 compound of formula (VII) where Pr is hydrogen may be treated 

with disuccinimidyl-carbonate and triethylamine in 
acetonitrile . Where a 4-nitrophenyl carbonate group is 
desired treatment with 4-nitrophenyl chlorformate and 
triethylamine in THF may be used. A pentaf luorophenyl 
15 carbonate may be added by in situ phosgenation of 
pentaf luorophenol followed by coupling to the linker of 
formula (VII) in which Pr is hydrogen. 

E: Compounds of the formula (V) in which Z 2 ie -O- : 

(i) The starting materials for the linkers possessing a 
20 carbamic bond are unsubst ituted or substituted (with the 
group(s) R £ (n , (as defined above)) 4 -hydroxy-benzylic 
alcohols. These type of linkers may require an extra electron 
withdrawing group on the aromatic nucleus in order to undergo 
1,4-elimination. The 4 -hydroxy group is specifically 
25 protected as an acetate by treating the starting material 
with acetyl-v-triazolo- [4 , 5-b] pyridine, IN NaOH in THF at 
20 oc. The alcohol function of the acetate is further 
protected as pyranyl- or TBDMSi-ether by the procedures 
described in section D above. The acetate function is then 
3 0 deprotected to restore the 4 -hydroxy group in NaHC0 3 aq. MeOH 
at 2 0°C. The resulting phenol compounds are reacted in a one 
pot procedure with a protected glutamyl -isocyanate as 
described in section D(i) above. This yields a compound of 
the formula of (VII) as shown above in which 2 2 is -O- and Pr 
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is the pyanyl- or t-butyl-dimethylsilyl (TBDMSi) -ether 
protecting group. 

(ii) Deprotection of this compound yields a compound of the 
formula (VII) in which Pr is hydrogen. This may be converted 
5 to compounds of the formula (V) by methods analogous to those 
described in sections D(ii) and (iii) above. 

Pf Alternative synt h esis of compounds of formula (IV); 

Compounds of the formula (IV) in which Q is hydrogen, fluoro, 
chloro, bromo or -0- (N-succinimide) may also be obtained by 
10 reference to WO95/02420 or WO95/03830. 

E(ii). Otfrcr prodrugs 

Other compounds which may be used as prodrugs include p- 
nitrobenzyloxycarbonyl derivatives of cytotoxic compounds. 
Such compounds can be used in conjunction with a 

15 nitroreductase enzyme. It is believed that the 

nitroreductase enzyme converts the nitro group of the prodrug 
into a hydroxylamino or amino group, which results in the p- 
nitrobenzyloxycarbonyl moiety becoming activated and then 
self-immolating. This releases the active drug. These 

2 0 compounds include a compound of formula: 
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The nitroreductase enzyme of WO93/08288 requires a co-factor 
such as NADH or NADPH, and this may optionally be supplied as 
an additional component in the system of the invention. 

Examples of other compounds described in the above references 
5 include prodrugs of actinomycin D, doxorubicin, daunomycin 
and mitomycin C. Prodrugs of the foregoing references are 
converted to active drugs by either nitroreductase or CPG2, 
although they may be modified to comprise protecting groups 
cleavable by other enzymes, eg /S-lactamase or glucronidase . 

10 Further prodrugs suitable for use in the invention include 
those of the general formula: FTLi - (PRT) n . or salts thereof 
where FTLi is a ras inhibitor such as a famesyltransf erase 
inhibitor compound and PRT represents m' protecting groups 
capable of being cleaved from the ras inhibitor by the action 

15 of an enzyme, where m' is an integer from 1 to 5. Such 
compounds are disclosed in WO95/03830. 

Other suitable prodrugs for use in the system of the 
invention include those which are derivatized with a sugar or 
a jS-lactam derivative. For example, suitable linkers which 
20 may be attached to active drugs of the type described above 
are : 
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where R' is hydrogen or acetyl and Y' is aryl such as phenyl, 
benzyl or tolulyl, and these may be made in an analogous 
manner to the other prodrugs described above. 

A further group of prodrugs are tyrphostin compounds of the 
5 general formula: PTKi-PRT m . where PTKi is a compound with FTK 
{protein tyrosine kinase) inhibitory activity, PRT is a 
protecting group capable of being cleaved from the PTK 
inhibitor by the action of an enzyme and m' is an integer 
from 1 to 5. 

10 Suitable tyrphostins such as the above may be obtained by the 
methods disclosed in, or analogous to those of, WO95/02420, 
Gazit et al 1989 and 1991, ibid, which are incorporated 
herein by reference. 

S(iii)* p?riv?ttivgB 

15 Physiologically acceptable derivatives of prodrugs include 
salts, amides, esters and salts of esters. Esters include 
carboxylic acid esters in which the non-carbonyl moiety of 
the ester grouping is selected from straight or branched 
chain C^alkyl, (methyl, n-propyl,, n-butyl or t -butyl ) ; or 

20 C 3 . 6 cyclic alkyl (e.g. cyclohexyl) . Salts include 

physiologically acceptable base salts, eg derived from an 
appropriate base, such as alkali metal (e.g. sodium), 
alkaline earth metal (e.g. magnesium) salts, ammonium and NR 4 . 
(wherein R" is C a _ 4 alkyl) salts. Other salts include acid 

25 addition salts, including the hydrochloride and acetate 
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salts. Amides include non-substituted and mono- and di- 
substituted derivatives. 

F. Applications of the invention 

The system of the invention can be used in a method of 
5 treatment of the human or animal body. Such treatment 
includes a method of treating the growth of neoplastic cells 
which comprises administering to a patient in need of 
treatment the system of the invention. It is also possible 
that the invention may be used to treat cells which are 
10 diseased through infection of the human or animal body by 
bacteria, viruses or parasites. 

One suitable route of administration is by injection of the 
particles in a sterile solution. While it is possible for the 
prodrugs to be administered alone it is preferable to present 
them as pharmaceutical formulations. The formulations 
comprise a prodrug, together with one or more acceptable 
carriers thereof and optionally other therapeutic 
ingredients. The carrier or carriers must be "acceptable" in 
the sense of being compatible with the other ingredients of 
the formulation and not deleterious to the recipients 
thereof, for example, liposomes. Suitable liposomes include, 
for example, those comprising the positively charged lipid 
(N [1- (2 , 3-dioleyloxy) propyl] -N,N, N-triethylammonium (DOTMA) , 
those comprising dioleoylphosphatidylethanolamine (DOPE), and 
those comprising 3(3 [N- (n' ,N' -dimethylaminoethane ) - 
carbamoyl] cholesterol (DC-Choi) . 

Formulations suitable for parenteral or intramuscular 
administration include aqueous and non-aqueous sterile 
injection solutions which may contain anti-oxidants , buffers, 
3 0 bacteriostatis, bactericidal antibiotics and solutes which 
render the formulation isotonic with the blood of the 
intended recipient; and aqueous and non-aqueous sterile 
suspensions which may include suspending agents and 
thickening agents, and liposomes or other microparticulate 



20 
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systems which are designed to target the compound to blood 
components or one or more organs. The formulations may be 
presented in unit-dose or multi-dose containers, for example 
sealed ampoules and vials, and may be stored in a freeze- 
dried (lyophilized) condition requiring only the addition of 
the sterile liquid carrier, for example water, for 
injections, immediately prior to use. Injection solutions 
and suspensions may be prepared extemporaneously from sterile 
powders, granules and tablets of the kind previously 
described. 

It should be understood that in addition to the ingredients 
particularly mentioned above the formulations may include 
other agents conventional in the art having regard to the 
type of formulation in question. Of the possible 
formulations, sterile pyrogen-free aqueous and non-aqueous 
solutions are preferred. 

The doses may be administered sequentially, eg. at daily, 
weekly or monthly intervals, or in response to a specific 
need of the patient. Preferred routes of administration are 
oral delivery and injection, typically parenteral or 
intramuscular injection or intratumoural injection. 

In using the system of the present invention the prodrug will 
usually be administered following administration ligand- 
enzyme. Typically, the protein will be administered to the 
patient and then the uptake of the protein monitored, for 
example by recovery and analysis of a biopsy sample of 
targeted tissue or by injecting trace-labelled protein ligand 
enzyme . 

The exact dosage regime will, of course, need to be 
determined by individual clinicians for individual patients 
and this, in turn, will be controlled by the exact nature of 
the prodrug and the cytotoxic agent to be released from the 
prodrug but some general guidance can be given. Chemotherapy 
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of this type will normally involve parenteral administration 
of both the prodrug and modified protein and administration 
by the intravenous route is frequently found to be the most 
practical. For glioblastoma the route is often 

5 intratumoural. A typical dosage range of prodrug generally 
will be in the range of from about 1 to 150 mg per kg per 
patient per day, which may be administered in single or 
multiple doses. Preferably the dose range will be in the 
range from about 10 to 75, e.g. from about 10 to 40, mg per 
10 kg per patient per day. Other doses may be used according to 
the condition of the patient and other factors at the 
discretion of the physician. 

Tumours which may be treated using the system of the present 
invention include any tumours capable or being treated by a 

15 LIDEPT system and thus are not limited to any one particular 
class of tumours. Particularly suitable tumour types include 
breast, colorectal and ovarian tumours, as well as 
pancreatic, melanoma, glioblastoma, hepatoma, small cell 
lung, non-small cell lung, muscle and prostate tumours. In 

20 the case of a LIDEPT system which uses VEGF, these and other 
types of solid tumours which comprise an actively growing 
vasculature are all candidates for treatment. 

The system of the invention may also be used to treat 
infections diseases, for example, and any other condition 
25 which requires eradication of a population of cells. 

It will be understood that where treatment of tumours is 
concerned, treatment includes any measure taken by the 
physician to alleviate the effect of the tumour on a patient. 
Thus, although complete remission of the tumour is a 
3 0 desirable goal, effective treatment will also include any 
measures capable of achieving partial remission of the tumour 
as well as a slowing down in the rate of growth of a tumour 
including metastases. Such measures can be effective in 
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prolonging and/or enhancing the quality of life and relieving 
the symptoms of the disease. 

The following Examples illustrate the invention. 
EXAMPLE 1 

5 In order to demonstrate the invention, a fusion protein 
between CPG2 and VEGF was prepared. This fusion protein is 
formed between portions of c-ErbB2 (signal peptide) , in order 
to direct expression outside of mammalian cells, CPG2 
containing specific mutations which block glycosylation 
10 (N222, 264 and 272 > Q, as described in PCT/GB95/01782 ) and 

VEGF. The structure of the fusion protein is: 
c-ErbB2 (amino acids 1-27) : Gly Ser : CPG2 (amino acids 23- 
415) : Glu Phe Gly Gly Gly Gly Gly Thr Ala: VEGF 165 (amino 
acids 28-191) . 

15 The DNA and protein sequences of CPG2 are shown in SEQ ID No. 

1 and 2 repectively, and this sequence was used except for 
the altered glycosylation sites as shown above. The sequence 
of VEGF can be found in Conn et al Proc. Natl. Acad. Sci . 
(1990) -87; 2628. 

20 This fusion was cloned into the mammalian expression vector 
pEF Plink 2 (Marais et al, (1995) EMBOJ H, 3136-3145) and 
transfected into COS-7 cells. In order to assess expression 
of the fusion protein, the culture medium from the 
transfected cells was analysed by immunoprecipitation, CPG2 

25 enzyme activity and heparin binding activity. 

( l ) I mmuno- precipitation . 

1 ml of tissue culture medium was immuno-precipitated with a 
CPG2 specific antiserum and the proteins eluted from the 
beads. The eluted proteins were anslysed by immuno- protein 
3 0 blot, and shown to contain a fusion protein with an apparent 
molecular weight of 60-64,000, which was not present in 
control transfected cells. 
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(2) The supernatants were also analysed for CPG2 activity 
using methotrexate as a substrate and found to contain CPG2 
activity which was absent in the control transf ection. 

{ 3 ) Hepar in-binding 
5 1 ml tissue culture medium was mixed with heparin-agarose, 
and the heparin agarose beads collected, washed and bound 
proteins were eluted and analysed by immuno- protein blotting 
using a CPG2 antiserum. The results show that supernatants 
from cells transfected with the fusion gene contained the 
10 CPG2-VEGF fusion protein and that this was competent to bind 
to heparin, whereas fusion between CPG2 and VEGF amino acids 
28-136 did not bind to heparin. 

These data demonstrate: 

(1) The fusion protein can be synthesised and excreted by 
15 mammalian cells. 

(2) The fusion protein contains CPG2 activity. 

(3) The fusion protein is stable in tissue culture medium. 

(4) the fusion protein is soluble. 

(5) The fusion protein is able to bind to heparin, through 
20 the VEGF component. 

Example 2 

1 . Introduction 

The VEGF proteins are secreted from cells via the endoplasmic 
reticulum (ER) and Golgi apparatus during which they become 

25 proteolytically processed and glycosylated to generate the 
mature proteins. (Eur J. Biochem, 211 , 19, 1993). There are 
at least four splice variants of the gene, which encode 
different isoforms of the protein. The proteins that these 
genes encode for are referred to by the number of amino acids 

3 0 present in the mature protein with amino acid number 1 
referring to the first amino acid in the mature protein. VEGF 
165 is one of the isoforms of VEGF, which consists of 165 
amino acids in the mature protein and which contains both a 
specific receptor interacting domain and a heparin-binding 
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domain. By contrast, VEGF 121 is an isoform consisting of a 
protein of 121 amino acids which contains the specific 
receptor interacting domain, but not the heparin-binding 
domain (Science, 253 . 9B9, 1992; Cell, 72, 635, 1993) . The 
5 aim is to prepare VEGF-enzyme fusion proteins to activate a 
subsequently administered prodrug which will generate a 
cytotoxic agent at the tumour • 

The bacterial enzyme Carboxypeptidase G2 (CPG2) is secreted 
from bacterial cells and is normally located in the 

10 periplasmic space. CPG2 catalyses the degradation of 
Methotrexate (MTX) and can also be used to cleave and thereby 
activate mustard prodrugs (J. Med. Chem, 22, 677, 1990) . 
These prodrugs, which are relatively non- toxic are cleaved by 
CPG2 into active bifunctional alkylating agents, which are 

15 highly toxic to mammalian cells. If the signal peptide from 
CPG2 is replaced with a mammalian signal peptide, and CPG2 
is expressed in mammalian cells, the protein enters the 
secretory pathway. However, this form of CPG2 becomes 
inappropriately glycosylated on three sites, resulting in an 

20 inactivate enzyme. The activity of this mammalian expressed 
form of CPG2 can be partially recovered if the three Asn 
residues that are in the core of the glycosylation motif are 
mutated into Gin residues. This results in a protein which 
retains about 10% of the activity of the bacterial enzyme. 

25 The purpose of this invention is to target CPG2 to tumours 
where they will be employed to cleave and activate prodrugs 
(including mustard prodrugs) . When CPG2 is targeted to the 
tumour, then the prodrug activation should only occur at that 
locus . 

3 0 2 . Description of the CPG2 and VEGF genes and Proteins 
Uaed 

In considering the possible types of conjugated proteins they 
may be produced, this invention employs proteins produced 
from genes that are ligated so that the recombinant proteins 
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that are produced are produced as ready-made fusions. Since 
VEGF is normally secreted from cells via the endoplasmic 
reticulum and Golgi apparatus, the aim was that the fusion 
proteins should pass through this secretory pathway so that 
5 the VEGF moiety would be expressed in its mature form. 
However, when CPG2 is processed through this secretory 
pathway in eukaryotic cells, it is subjected to inappropriate 
and inactivating glycosylation. Therefore constructions of 
fusions between VEGF and the form of CPG2 which cannot be 

10 glycosylated were made. The feasibility of fusions in which 
the VEGF moiety was either ligated to the amino -terminus (N- 
terminus) or the carboxyl- terminus (C-terminus) to CPG2 was 
tested. In addition, the presence or absence of the VEGF 
heparin-binding domain was examined for its ability to affect 

15 the activity of the fusion proteins. A series of fusion 
proteins were therefore constructed for expression in 
eukaryotic cells that would allow these parameters to be 
tested. 

2.1 The CPG2 acne «nd protein 

2 0 The CPG2 gene used was modified for expression in mammalian 

cells. The codons for the CPG2 signal peptide were replaced 
with the signal peptide from the human c-erbB2 protein to 
ensure efficient secretion of the CPG2 protein. For c-erbB2 
signal sequence see Coussens et al (1985) Science 230 ; 1132- 
25 113 9. Furthermore, the three Asn residues at the core of the 
glycosylation motifs were mutated to Gin residues to prevent 
glycosylation, and also a polyhistidine tag was added to the 
3' end of the gene so that the fusion proteins could be 
purified by the nickel -NTA affinity chromatography. This 

3 0 construct is referred to as CPH 6 and is represented 

schematically in Figure 1A. For expression in mammalian 
cells, the gene for this construct was cloned into the 
mammalian expression vector pEFPlink 2. This vector was 
chosen since it uses the promoter from the elongation factor 
35 la gene to direct expression of foreign genes in mammalian 
cells and has been shown to be highly active in a variety of 
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cell types. The structure of the protein that the CPH 6 gene 
is predicted to express is shown schematically in Fig. 1A; 
the complete DNA and protein sequences are given in SEQ ID 
Nos . 3 and 4 . 

5 2.2 The VEGF ocna and protein 

The VEGF gene in the plasmid pVEGF165 (in Bluescript) encodes 
for the VEGF signal peptide and a splice variant in which the 
mature protein consists of 165 amino acids. This encodes the 
specific receptor binding domain of VEGF encompassed within 

10 residues 1-121 for the mature protein. VEGF165 also encodes 
a heparin-binding domain, which requires the region of amino 
acids from 122-165 of the mature protein. In this gene there 
is a single glycosylation motif located at residue 75 of the 
mature protein (VEGF165) JBC, 266, 11947, 1991). The 

15 structure of the protein that the VEGF gene is predicted to 
express is shown schematically in Fig IB; the complete DNA 
and protein sequences are given in SEQ ID Nos. 4 and 5. 

3 . Generation of the Fusion Proteins 

In order to generate the various genes that were to be tested 

2 0 in the LIDEPT approach, the CPH 6 and VEGF165 genes were used. 

Mutations to produce the fusions were created by PCR directed 
mutagenesis, which has the advantage in its precise nature 
that it allows defined fusions to be generate at exact 
positions, the PCR techniques used are standard and can be 
25 found in any basic text, although an example of such a 
protocol is shown below. 

DNA samples were heated through thermal cycles in a 
proprietary heat cycling instrument, in the presence of 
mutagenic primers, nucleotides and appropriate buffers to 

3 0 generate DNA samples that can then be cloned into appropriate 

recipient vectors. Typical heating cycles are presented below 
and the enzyme Taq polymerase was used to generate the DNA. 
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95°C 60 sec 
55°C 30 sec 
72°C 30 sec 
25 cycles were generally employed. 

5 The products from these reactions were cloned into recipient 
vectors and analysis by dideoxy- sequencing techniques were 
used to verify the integrity of the fragments generated. 

Fusions proteins were tested in which the VEGF and CPG2 
proteins were fused to each other in both orientations and in 

10 which the VEGF moiety contains or lacks the heparin-binding 
domain. Four fusions were created, with CPG2 fused either to 
the N-, or to the C-terminus of VEGF, where the VEGF either 
contained or lacked the heparin binding domain. These 
constructs are represented schematically in Fig 1 and are 

15 described in detail below. 

3 . 1 V m -CPg« 

In this clone, the predicted protein product would be a 
fusion in which the CPG2 moiety was fused to the C-terminus 
of amino acids 1-161 of mature VEGF. Since the VEGF moiety is 

20 located at the N-terminus of the fusion, its own signal 
peptide will direct the protein to the secretion pathway of 
the cell. The c-erbB2 signal peptide was removed from the 
CPG2 gene. This fusion would therefore contain both the 
receptor-binding domain and also the heparin -binding domain 

25 of VEGF. It also contains a C-terminal polyhistidine tag for 
purification purposes. 

The cloning of this fusion was performed as follows. The PCR 
was used in conjunction with oligonuleotides primers 1 and 2 
(Table 1) and plasmid pVEGF165 to introduce silent mutations 
3 0 within the VEGF open reading frame that would destroy the 
internal Ncol site located within that gene. This was 
performed to facilitate ease of cloning at the later stages. 
Next, a PCR fragment was generated by primers 3 and 4 (Table 
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1) in conjunction with the altered VEGF gene was digested 
with the restriction endonucleases Ncol and Bam HI and the 
resulting fragment was cloned into those sites in plasmid 
pEFCPH 6 , This strategy results in a gene that would encode 
the VEGF signal peptide, VEGF amino acids 1-161, fused to 
CPG2 amino acids 23-415, and containing a polyhistidine tag 
at the C-terminus for purification purposes. This construct 
is referred to as V 161 CPH 6 and the protein is represented 
schematically in Fig 1C. The sequence of the V 161 CPH € gene and 
the protein, it is predicted to encode, are given in SEQ ID 
Nos . 7 and 8 . 

3 • 2 SxxtCPH, 

In this clone, the CPG2 is fused to the C-terminus of VEGF. 
The portion of VEGF used for this clone is the first 115 
amino acids of the mature protein and therefore does not 
contain the heparin-binding domain. As with V 161 CPH 6 the 
signal peptide of VEGF will direct the secretion of the 
fusion protein and therefore the c-erbB2 signal peptide was 
removed . 

In cloning this fusion protein, the internal Ncol site within 
VEGF was destroyed with PCR and olignucleotides primers 1 and 
2 (Table 1) as describe din Section 3.1, in order to 
facilitate easier cloning at the later stages. A PCR was 
performed in conjunction oligonucleotides 3 and 5 (Table 1) 
and the altered VEGF gene and the product was digested with 
the restriction endonucleases Ncol and Bam HI. The resulting 
fragment was cloned into these sites in plasmid pEFCPH € . This 
strategy produces a gene which - would encode the signal 
peptide from VEGF and the first 115 amino acids of the mature 
protein fused to amino acids 22-415 of CPG2 and the 
polyhistidine tag. This construct is referred to as V 115 CPH 6 
and the protein is represented schematically in Fig ID. The 
sequence of the V 115 CPH 6 gene, and the protein that it is 
predicted to encode, are given in SEQ ID Nos. 9 and 10. 
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3 . 3 £PVi« 

In this clone, CPG2 is fused to the N- terminus of VEGF. In 
this arrangement, the c-erbB2 6ignal peptide is used to 
direct secretion of the protein from the mammalian cells and 
5 so the VEGF signal peptide was removed. This fusion contains 
both the VEGF receptor binding domain and the heparin-binding 
domain. However, in the cloning of this gene, the 
polyhistidine tag was lost from the C-terminus of CPG2 . 

The cloning of this fusion was performed as follows. Standard 

10 cloning techniques employing oliognucleotides 6 and 7 (Table 
1) were used to replace the polyhistidine tag at the 3' end 
.of the CPH 6 gene with a peptide spacer of 6 amino acids and 
also to create a unique Kpn I site at that position. The 
structure of the linker is (Gly) 5Thr and was designed to 

15 given flexibility to the region between the two protein 
moieties. A PCR was performed with oligonucleotides 8 and 9 
(Table 1) in conjunction with pVEGF165 to generate an altered 
VEGF gene. This PCR product was digested with the restriction 
endonucleases Kpn I and Xba I and cloned into the freshly 

2 0 created Kpn I site and Xba I sites of pEFCPH 6 . This 
arrangement results in a gene whose structure is the signal 
peptide from c-erbB2 (amino acids 1-27-CHECK) fused to 
CPG2(Q)3 (amino acids 22-415) fused, via the spacer to codons 
1-165 of mature VEGF. This construct is referred to as CPV 16S 

25 and the protein is represented schematically in Fig IE. The 
sequence of the CPV 165 gene, and the protein it is predicted 
to encode, are given in SEQ ID Nos. 11 and 12. 

3.4 £PVio* 

During the cloning of CPV 165 it was noticed that one of the 
30 clones that was generated contained a mutation at codon 110 
of the mature VEGF sequence. This converts an AGA (Arg) codon 
into a TGA (stop) codon and thus results in a truncation of 
VEGF at that position. Since this mutation fortuitously 
removes the heparin-binding domain of VEGF, we chose to 
35 characterise this fusion protein further to enable us to 
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assess the results of having VEGF fused to the C-terminus of 
CPG2 in which the heparin-binding domain of VEGF was absent. 
This construct is referred to as CPV 109 and the protein is 
represented schematically in Fig IF. The sequence of the 
5 CPV 109 gene and the protein that it is predicted to encode are 
given in SEQ ID Nos . 13 and 14. 

Ficrure 1. Schematic representation of CPG2 , VEGF and the 
fusion proteins 

Each of the proteins is represented as a bar which has 
10 variable shading to represent the different section of the 
fusions. The proteins are all depicted with the amino- 
terminus to the left and the carboxyl- terminus to the right 
(N and C respectively) . The fusion junctions between the 
proteins are represented by open boxes. In the cases of 
15 CPV 165 , CPV 109 , CPV 1€5 H 6 and CPV 109 H 6 , the peptide linker between 
the CPG2 and VEGF moieties is represented by a stippled box 
(L) . Two forms of signal peptides have been employed to 
target the proteins for the secretory pathway. One is the 
signal peptide from the mammalian tyrosine kinase receptor 
2 0 c-erb B2 (SP erb ) and the other is the signal peptide from the 
VEGF gene (SP V ) . Numbers in brackets indicate the amino 
acids present in either the CPG2 or VEGF motif for each 
fusion. The position of the amino acids that are required 
for heparin-binding by VEGF are indicated by the hatched box. 

2 5 The position of glycosylation in the VEGF fragments is 

indicated by The three sites in CPG2 which must be 

mutated to prevent glycosylation are indicated by o. Where 
the fusions express a polyhistidine tag, this is indicated 
(H 6 ) . The clones represented are: A, CPH 6 ; B. VEGF 1€5 ; C. 

3 0 V 161 CPH 6 ; D. V 115 CPH 6 ; E. CPV 16S ; F. CPV 109 ; G. CPV 161 H 6 ; H. CPV 109 H 6 . 

Figure 2. Expression and h eparin binding of CPH g and the 
fusion proteins 

The tissue culture supernatants from the transfected COS 
cells as described in Table 2 were subjected to Western blot 
35 analysis and analysis for the ability to bind to heparin. 
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A. Innaunopracipitation and immunoprotain blot analysis 

For each sample, 500/xL of the conditioned medium from cells 
transfected with the relevant expression vectors was made up 
to 0.1% v/v with Triton X-100 and then incubated with of 
5 a rabbit polyclonal antiserum specific for CPG2 (Marais et al 
1996) immobilised on Protein G-Sepharose beads (Pharmacia) . 
The complexes were formed for 6 hours at room temperature and 
then the beads were washed three times with 500/xL of PBS 
containing 0.1% Triton X-100. The bound proteins were eluted 

10 from the beads, resolved on an 8% SDS-gel and analysed by 
immunoprotein blotting following standard techniques and 
using the CPG2 specific antiserum. The protein expressed in 
each sample is indicated above the appropriate lanes and are: 
lane 1, pEFCPH 6 ; lane 2 # pEFVEGF 165 ; lane 3, pEFV 1€1 CPH 6 ; lane 

15 4, pEFV 115 CPH 6 ; lane 5 f pEFCPV 109 ; lane 6, pEFCPV 1€5 ; lane 7, 
control (pEFPlink.2) . The position of migration of protein 
markers (x 10' 3 ) is shown to the right of the figure and the 
positions of migration of the major products of each gene are 
indicated by the arrows to the left of the figure. 

20 B. Heparin-binding of the fusion proteins 

For each sample, 500/iL of the conditioned medium was made up 
to 0.1% v/v with Triton X-100 and then incubated with -35/iL 
of Heparin- Sepharose beads at 4°C for -16 hours. The beads 
were washed three times with 500/iL of PBS containing 0.1% 

25 Triton X-100 and the bound proteins were eluted from the 
beads, resolved on an 8% SDS-gel and analysed by 
immunoprotein blotting following standard techniques with the 
CPG2 specific antiserum (Marais et al 1996) . The protein 
expressed in each sample is indicated above the appropriate 

3 0 lanes and are: lane 8, pEFCPH 6 ; lane 9, pEFVEGF 165 ; lane 10, 
pEFV 161 CPH € ; lane 11, pEFV 115 CPH 6 ; lane 12, pEFCPV 109 ; lane 13, 
pEFCPV 165 ; lane 14, control (pEFPlink. 2 ) . The position of 
migration of protein markers (x 10* 3 ) is shown to the right 
of" the figure and the positions of migration of the major 

35 products of each gene are indicated by the arrows to the left 
of the figure. 
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Fiqyr? 3: Purification of CPV,„ fey Heparin- SeDharoae 

chromatography 

Conditioned culture medium from mammalian cells transfected 
with pEFCPV 165 was subjected to a purification protocol by 
Heparin-Sepharose chromatography. For this protocol, 17ml of 
conditioned medium harvested from COS cells 72 hours post- 
transfection was passed over a 700/aL Heparin- Sepharose CL 4B 
column which was then washed with 7ml column buffer (lOOmM 
Tris.HCl, 2£0fiM ZnCl 2 , pH 7.3). The column was eluted 
sequentially with 2*8mL column buffer containing 400mM NaCl, 
2.8mL column buffer containing 800mM NaCl, 2 . 8mL column 
buffer containing 1.2M NaCl, 2.8 mL column buffer containing 
. 1.6M NaCl and 2 . 8mL column buffer containing 2M NaCl at the 
positions indicated by the arrows. 0.7ml fractions were 
collected. The elution of bulk proteins is indicated (closed 
circles) and was determined using the Bio-Rad protein 
determination kit, with OD measurements at 595nnu The 
elution of the CPG2 activity was determined using MTX as a 
substrate as described in Table 2 and is indicated by the 
open circles. 

Figure 4 . Purification of fusion proteins bv Ni++-NTA 
agarose affinity chromatography, 

Sf9 cells were infected with baculoviruses encoding CPH 6 and 
V 115 CPH 6 and the proteins were purified by Ni++-NTA agarose 
affinity chromatography as described in Table 5. Samples 
from the load fraction (LOAD, lanes 1, 3) and purified 
proteins (ELUATE, lanes 2, 4) were analysed by SDS-PAGE 
followed by silver staining to determine their purity. The 
samples for CPH 6 are in lanes 1 and 2 and the samples for 
V 115 CPH 6 are in lanes 3 and 4. The position of migration of 
the purified proteins are indicated on the left of the figure 
as are the position of migration {x 10~ 3 ) of standard 
proteins, to the right of the figure. 
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figure 5. Diner an alysis of fusion proteins. 

V 115 CPH 6 (lanes 1, 3) and CPH 6 (lanes 2, 4) expressed in Sf9 
cells and purified by Ni++-NTA agarose affinity 
chromatography as described in Table 5 were used in this 
5 analysis. For each sample, -50ng of protein were heated to 
65°C for 5 min in the absence (non-reduced; lanes 1, 2) or 
presence (reduced; lanes 3, 4) of 2-mercaptoethanol (5% v/v) . 
The samples were resolved on a 12% SDS-gel and revealed by 
immunoprotein blotting with the CPG2 specific antiserum. The 
10 position of CPH 6 is indicated as are the positions of 
migration of dimeric (dimer-V 115 CPH 6 ) or monomeric (monomer- 
V 115 CPH 6 ) V 11S CPH 6 . The portion of standard proteins (x 10~ 3 ) 
is indicated to the right of the figure. 

Figure 6. Schematic representation and DNA and protein 
15 sequence? pf EDR(Bp) 

A. Schematic representation of the KDR(BD) . 

The protein structures of KDR and KDR(BD) proteins are 
represented schematically. The proteins are in the 
conventional N-terminal to C-terminal orientation as 

20 indicated (N and C respectively) . The top figure represents 
the complete KDR protein, where the individual regions are 
differently shaded. The regions represented are the signal 
peptide (SP) , the extracelluar domain (ECD) , transmembrane 
region (TM) and kinase domain (KD) are indicated. Also 

25 indicated are codons 809 and 810, where the EcoRl site in the 
gene resides; this was used to truncate the protein to make 
KDR(BD) as shown in the lower schematic. The 9E10 epitope 
which was added to the C-terminus of the truncated protein is 
represented as a hatched box. 

3 0 Figure 7 ; Expression of KDR (BP) in Sf9 Cells 

Sf9 cells were infected with viruses expressing KDR (BD) , or 
as- a control, a virus with a defective polyhedron gene which 
does not encode any know protein (empty) . For each sample, 
10 7 cells were extracted in lysis buffer (20mM Tris.HCl, 
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0 . 5mM EDTA, 10% v/v glycerol, 0.3M KCl, 1% v/v Triton X-100, 
5/xg/ml Leupeptin, 5/ig/ml Pepstatin A, 50/xg/ml 
phenylmethylsulphonyl flouride, ImM benzamidine, pH8) 60 
hours post infection. The protein content of the extracts 
5 was determined using the Bio-Rad protein determination kit. 
For each extract (KDR(BD), lanel; empty, lane 2), 4.5/xg of 
protein was loaded onto a 7% SDS-Gel and the proteins therein 
detected by immunoprotein blotting, using the 9E10 mouse 
monoclonal antibody. The position of migration of the 
10 KDR(BD) is indicated by the arrow, and the portion of 
standard proteins (x 10" 3 ) is indicated to the right of the 
figure . 

Figure 8 : Binding of fusion proteins to KDR(BD) 
The ability of the fusion proteins to bind to the KDR(BD) was 
15 tested in vitro. The KDR(BD), expressed in Sf9 cells was 
immunoprecipitated with the 9E10 monoclonal antibody and then 
the ability of purified fusion protein samples to bind to the 
9E10/KDR (BD) immunocomplex was determined by measuring the 
CPG2 enzyme activity. For each sample, the KDR(BD) from 

2 0 750/xg of insect cell extract, as prepared in Fig 7 was 

immunoprecipitated with ~50/xg of 9E10 monoclonal antibody 
immobilised on protein G-Sepharose beads (+ KDR(BD) ; lanes 6- 
10) . As a control, extracts from cells infected with the 
control virus (- KDR(BD) ; lanes 1-5) were used instead of the 

25 KDR(BD) expressing virus. The VEGF/CPG2 fusion proteins were 
also expressed in Sf 9 cells and prepared as Ni++-NTA Agarose 
purified proteins, as described in Table 5. The fusion 
proteins were balanced against each other by CPG2 enzyme 
activity and for each fusion, the protein represented by 

30 4.8mU of CPG2 activity was added to the 9E10/KDR (BD) 
immunocomplexes. The samples were incubated for 2 hours at 
4°C and washed twice in wash buffer 500 (50mM Tris.HCl, 500mM 
NaCl, pH8) and then once in wash buffer 100 (50mM Tris.HCl, 
500mM NaCl, pH8) . The amount of CPG2 activity retained by 

3 5 the immunocomplexes was determined as described in Table 4. 
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Figurg 9 : Non-specific cell cytotoxicity directed bv thg 



A. Sensitivity of KIH3T3 cells to the CKDA prodrug. 

Confluent NIH3T3 cells were cultured in the absence (control, 
5 lane 1) or presence of ImM CMDA prodrug (+CMDA, lane 2) for 
18 hours. After incubation, the cells were diluted into 
fresh dishes (at a dilution of 100 fold) and allowed to grow 
for a further 6 days after which time the survival of the 
cells was determined by [ 3 H] thymidine incorporation. The 
10 results are expressed in terms of the % growth in terms of 
the control, which were considered to represent 100% cell 
growth. 

B. Sensitivity of NIH3T3 cells to the CMDA prodrug in the 
presence of the fusion proteins. 

15 Confluent NIH3T3 cells were cultured in the presence of ImM 
CMDA prodrug in the absence of any additions (control, lane 
3) or in the presence of the indicated fusion proteins (lanes 
4-8). The additions were : lane 4, CPH 6 ; lane 5, CPV 161 H 6 ; 
lane 6, V 1€1 CPH € ; lane 7, V 115 CPH € ; lane 8, pEFCPV 109 H 6 . For 

20 each fusion, 0.3 to 0.9 nM of the fusions were added as 
determined by quantitative immunoprotein blotting. The cells 
were incubated and processed as in section A above for cell 
growth. The results are expressed in terms of the % growth 
in terms of the control lane, which were considered to 

25 represent 100% cell growth. 

Figure 10 ; VEQF dependent cell cytotoxicity directed bv the 
fugjion proton V 115 £PB C 



A. Sensitivity of Hu-V-ec cells to the CMDA prodrug. 

Confluent Hu-V-ec cells were cultured in the absence 
3 0 (control, lane 1) or presence of ImM CMDA prodrug (+ CMDA, 

lane 2) for 18 hours. After incubation, the cells were 
diluted into fresh dishes (at a dilution of 3 fold) and 
allowed to grow for a further 6 days after which time the 



WO 97/26918 



PCT/GB97/00221 



- 41 - 

survival of the cells was determined by [ 3 H] thymidine 
incorporation. The results are expressed in terms of the % 
growth in terms of the control, which were considered to 
represent 100% cell growth. 

B. Sensitivity of Hu-V-ec cells to the CMDA prodrug in the 
presence of the fusion proteins. 

Confluent Hu-V-ec cells were cultured in the absence of any 
additions (control, lane 3) or in the presence of CPH 6 (27 
nM, lane 4), or of V 115 CPH 6 {llnM, lane 5) for 30 min. The 
cells were then washed S times with fresh medium and 
incubated overnight in the presence of ImM CMDA. The cells 
were re -plated at a dilution of 1/3 and allowed to grow for 
a further 4 days after which time the cell growth was 
estimated by [3H] thymidine incorporation. The results are 
expressed in terms of the % growth in terms of the control 
lane, which were considered to represent 100% cell growth. 
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OLIGONUCLEOTIDES USED IN MUT ATION GENERATION 



Oligo. 


1 


GCTGCACCTATGGCAGAAGG 


Oligo. 


2 


CCTTCTGCCATAGGTGCAGC 


Oligo. 


3 


CGGCCATGGACTTTCTGCTGTCTTGGGTGC 


Oligo. 


4 


GG CGGAT CCGT CACAT CTGCAAGTACGTTC 


Oligo. 


5 


GGCGGATCCATTTTCTTGTCTTGCTCTATCTTTC 


Oligo. 


6 


AATTCGGAGGTGGCGGAGGTACCTCTGGAGGCGGTCCAGGAGGTGGCGG 






GTC 


Oligo. 


7 


CATGGACCCGCCACCTCCTGGACCGCCTCCAGAGGATACCTCCGCCACC 



TCCG 





Oligo. 


e 


GGAAGCTTGGTACCGCACCCATGGCAGAAGG 




Oligo . 


9 


GGTTCGAATCTAGACCCGGCTCACCGCCTCGG 




Oligo. 


10 


AATTCCATCATCACCACCATCACGCTTCCTAGT 


15 


Oligo. 


11 


CTAGACTAGGAAGCGTGATGGTGGTGATGATGG 




Oligo. 


12 


GGCGGATCCATCTTTCTTTGGTCTGCATTC 




Oligo. 


13 


CGGATCACGGCCATGGAGAGCAAGGTGCTGCTG 




Oligo. 


14 


AGCAATAAATGGAGATCTGTAATCTTG 




Oligo. 


15 


CGATGAGCAGAAGCTGATATCCGAGGAGGACCTGAA 


20 


Oligo . 


16 


CTAGTTCAGGTCCTCCTCGGATATCAGCTTCTGCTCAT 



The oligonucleotides used are presented in the conventional 
5 ' - 3 ' orientation . 
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TABLE 2 ; ANALYSIS OF CPG2 A CTIVITY IN CONDITIONED CULTURE 
MEDIUM FROM THE TRANS FECTE D COS CELLS 



FUSION PROTEIN EXPRESSED 


CPG2 ACTIVITY U/ml* 1 


Control 




pEFCPH,, 


0.096 


P EFV lsl CPH fi 


0.011 


pEFV, n ^CPH fi 


0.045 


pEFCPV,„ 


0.004 


pEFCPV 109 


0.17 


pEFVEGF 165 


0 



The tissue culture medium from COS cells transfected with the 
indicated vector was harvested and examined for CPG2 
activity. Transf actions were performed by standard techniques 
using the lipfectAMINE reagent (Marals et al 1995) . For CPG2 
15 enzyme assay, 100/iL, of culture medium was incubated in CPG2 
buffer (100 mM Tris . HC1, 260 /xM ZnCl 2/ 50/iM MTX, pH 7,3), at 
25°C and the rate of change in absorbance was monitored at 
320nm. From this, the CPG2 activity in terms of units CPG2 
per ml of culture medium (U/ml) was calculated. 
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TAPES 3 i PREDICTED AND OBSERVED SIZES OF THE EXPRESSED 
PROTEINS 



Protein 


NO Of 
aa 


Mr 

(Predicted) 


Apparent M T 
(Observed) 
★ 


CP(Q)3-H6 


411 


43731 


45000 


VEGF165 


165 


19152 


N.D. 


V161CP(Q)3-H6 


566 


61710 


60-62000 


V115CP(Q)3-H6 


520 


56386 


54-58000 


CP(Q)3V165 


574 


62272 


58-60000 


CP(Q)3V109 


518 


55657 


52-55000 


CP (Q) 3V161-H6 


578 


62660 


60-62000 


| CP (Q) 3V109-H6 


530 


57135 


59-61000 



The table represents the number of amino acids predicted in 
each mature fusion protein and from that the predicted 
molecular masses calculated; these masses therefore take into 
15 account the loss of the signal peptides. The observed masses 
are determined from the SDS-PAGE analysis shown in Figure 2A. 

N.D. - not determined 

* The observed sizes are calculated from the predominant 
band seen in each case in Figure 7A. 
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TABLE 4 ; EXPRESSION OV THE FUSION PROTEINS IN Sf9 

CELLS 



Fusion protein expressed 


U/ml 1 


Empty 




CPH 6 


0.4 


V lfi1 CPCPH fi 


0.06 


V^CPHk 


0,2 


CPV 1fi1 H* 


0.01 


CPV 109 H 6 


0.04 



10 The tissue culture medium from Sf9 cells infected with 
viruses encoding the indicated proteins was harvested and 
examined for CPG2 activity. For this experiment, 10 7 insect 
cells were infected at a multiplicity of infection of -4-10. 
The cells were incubated with 10 mL of tissue culture medium 

15 for 60 hours. The medium was harvested and the cells removed 
by centrifugation. For the MTX assays, samples of medium were 
incubated in CPG2 buffer {100 mM Tris. HCl, 260^M ZnCl 2 , 
50 /iM MTX , pH 7.3), at 25°C and the rate of change in 
absorbance was monitored at 320 nm. From this, the CPG2 

20 activity in terms of units CPG2 per ml of culture medium 
(U/ml) was calculated. 



WO 97/26918 PCT/GB97/0022I 



- 46 - 

TAPLB 5 J. PROTEIN PURIFICATION BY Ki+i- NTA APPINTTY 
CHROMATOGRAPHY 



Fraction 


C 


:ph € 




5 CPH« 


% TOTAL 
PROTEIN 


% CPG2 
ACTIVITY * 


k TOTAL 
PROTEIN 


* CPG2 
ACTIVITY 
* 1 


LOAD 


100 


100 


100 


100 1 


FLOW THROUGH 


94 


0 


94 


0 | 


WASHES 


2 


0 


2 


0 


ELUATE 


4 


99 


4 


100 



The table shows the purification of CPH 6 and V 115 CPH € by NI„ 4 
NTA-affinity chromatography ♦ For each protein, 10 7 insect 

10 cells were infected at a multiplicity of infection of 4-10. 
The cells were incubated with 10 mL of tissue culture medium 
for 60 hours and the medium was harvested, removing the cells 
by centrif ugation. The condition medium was extensively 
dialysed against dialysis buffer (50mM Tris. HC1, 500 mM 

15 NaCl, pH8) and then loaded onto a 0 . 5 ml column buffer (50 mM 
Tris. HCl, 500 mM NaCl, 10% glycerol, 0.5V v.v NP40, 10 mM 
imidazole, 5Mg/ml Leupeptin, 5Mg/ml Pepstatin A, 50 Mg/ml 
phenylmethyl sulphonyl fluoride, ImM ben2amidine, pH8) . The 
column was washed with 5ml of column buffer containing 20 mM 

2 0 imidazole and the bound proteins were eluted in column buffer 
containing 90 mM imidazole. The table shows the proportion of 
total protein and CPG2 activity located in each of the column 
fractions . 



* The CPG2 activity was determined in enzyme assay, using 
25 MTX as a substrate as described in Table 4 . 
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TABLE 6 ? KINETIC ANALYSIS OF PURIFIED PROTEINS. 



1 CONSTRUCT 


K„ (MM) 


CPH 6 


19.6 ± 3.2 


V lfil CPH fi 


18.2 ± 1.5 


V 115 CPH fi 


17.0 ± 1.0 


| CPV 109 H fi 


9.1 ± 1.1 



Sf9 insect cells were infected with viruses expressing CPH € , 
V 151 CPH 6/ V 115 CPH 6 and CPV 109 H 6 . The culture medium was harvested 
and the fusion proteins purified by Ni++-NTA Agarose affinity 
10 chromatography as described in Table 5. The purified proteins 
were tested for ability to cleave MTX as described in Table 
4 and standard regression analysis was used to determine the 
affinity (KJ of each fusion for the substrate. 
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i_j Characterisation of the Fusion Proteins 

The first experiments were directed at determining whether 
the fusion proteins we had generated could be expressed in 
eukaryotic cells. We used a transient transfection system 
based on COS cells for this, since these cells are convenient 
to culture and transfect using the Lipof ectAMINE reagent. 
The genes for V 161 CPH 6 , V 115 CPH € , CPV 165 , CPV 109 and VEGF 165 . 
were cloned into the mammalian expression vector pEFPlink.2. 
These vectors are referred to as pEFV 161 CPH 6/ pEFV 115 CPH 6 , 
pEFCPV 165 , pEFCPV 109 and pEFVEGF 165 respectively, COS cells 
were transfected with these plasmids or with pEFPlink.2. 
Since all of the proteins expressed by these clones would be 
-expected to be secreted, the conditioned medium was collected 
from the transfected cells 72 hours after transfection and 
examined for the presence of the fusion proteins as outlined 
below . 

4*1 CPG2 activity of fusion protein 

First, the conditioned medium from the transfected cells was 
examined for the presence of CPG2 enzyme activity. Samples 
of culture medium were analysed for their ability to degrade 
the CPG2 substrate MTX. The results show that in cells 
transfected with pEFPlink.2, there is no detectable CPG2 
activity whereas the medium from the cells transfected with 
pEFCPH 6 accumulated CPG2 activity to a level of 0.096 units 
of CPG2 per mL (U/mL) of medium (Table 2) . The medium from 
the cells which were transfected with each of the fusion 
proteins were also found to accumulate CPG2 activity with a 
range of 0.004 to 0.14 U/ml culture medium. Thus it can be 
seen that all four fusion proteins have the ability to 
express CPG2 in a form that is active and which is secreted 
from the cells and accumulates in the culture medium. That 
this accumulation of CPG2 is not due to the expression of 
VEGF in these cells can be discounted because the medium from 
the cells transfected with pEFVEGF 165 do not accumulate CPG2 
activity (Table 2) . 
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4.2 SDS-analv sis if fusion proteing 

Although the results shown in section 4-1 indicate that CPG2 
activity accumulates in the tissue culture medium, it does 
not show the presence of intact fusion proteins. In order to 
5 determine whether the secreted fusion proteins were subjected 
to proteolytic degredation, we examined them by sodium 
dodecyl sulphate -polyacrylamide gel electrophoresis (SDS- 
PAGE) and immunoprotein blotting. Samples of conditioned 
medium from each of the COS transf ections were subjected to 
10 immunoprecipitation with a CPG2 specific antiserum. The 
immunoprecipitated proteins were resolved by SDS-PAGE and the 
CPG2 proteins revealed by immunoprotein blotting with the 
CPG2 specific antiserum. 

There were no proteins which are recognised by the CPG2 

15 specific antiserum in the conditioned medium from the cells 
transf ected with pEFPlink.2 (Fig 2, lane 7) . A protein with 
a Mr of -45,000 was recognised by the CPG2 antiserum in the 
medium of cells transf ected with pEFCPH 6 (Fig 2, lane 1) , 
The size of this protein is similar to the mass predicted for 

20 CPH 6 (Table 3) . The culture medium from the cells 
transf ected with pEFV 161 CPH 6 , pEFV 115 CPH 6 , pEFCPV 165 and 
pEFCPV 109 all accumulated proteins which were recognised by 
the CPG2 specific antibody and are all were larger in mass 
than CPH 6 (Figure 2, lanes 1, 3, 4, 5, 6). Each of the 

25 fusions exists as multiple bands on SDS-PAGE and this was 
attributed to variable glycosylation on the VEGF moiety. 
When the apparent molecular masses of the predominant bands 
were calculated from the SDS-gel, they were found to be close 
to the predicted masses expected for each fusion, when the 

30 size of the signal peptides is taken into account (Table 3) . 

These data indicate that the fusion genes produce the 
expected and correct proteins. The proteins are of the 
correct sizes and there is no significant accumulation of 
free CPG2 in any of the media from the cells transf ected with 

35 expression vectors encoding the fusion proteins. This 
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indicates that the fusions are stable and not subjected to 
proteolytic degredation in the tissue culture medium. 

4.3 Heparin-binding of the fuaion proteing 

The fusion proteins were tested for their abilities to bind 
5 to heparin with Heparin-Sepharose CL-6B (Pharmacia) which is 
a column matrix consisting of Sepharose beads to which 
heparin is covalently bound. The conditioned medium from the 
transfected cells was incubated with Heparin-Sepharose CL-6B 
and the proteins which were found to bind to the matrix were 
10 identified by immunoprotein blotting. 

CPH 6 is shown by this analysis to be unable to bind to 
heparin (Fig 2B, lane 3) . The two fusion proteins which 
contain the heparin binding domain of VEGF (V 1€1 CPH 6 and 
CPV 165 ) bound to heparin with high efficiency (Fig 2B, lanes 

15 10 and 13) . In fact, when these results were quantitated it 
was seen that even under these crude conditions, more than 
60% of V 1€1 CPH 6 and CPV 1€5 were able to bind to the heparin- 
matrix. Weak binding of V 115 CPH 6 to the heparin-matirx (Fig 
7B, lane 11) but when this was quantitated, it was shown to 

2 0 represent less than 7% of the fusion bound to the matrix. 

Furthermore, it should be noted that the amount of V 115 CPH 6 
that was used in this analysis was far greater than either 
V 161 CPH € or CPV 165 and so the figure of 7* bound probably 
represents an over-estimation of the true binding relative to 

2 5 that seen with V 1€1 CPH 6 or CPV 165 . CPV 109 was also found to 
bind to the heparin-matrix (Fig 2B, lane 12), but when 
quantiated this was found to be only -1.3% bound and deemed 
not to be significant. These results show that as predicted, 
when the VEGF heparin-binding domain is present in the fusion 

30 proteins, it enables them to bind to the heparin-matrix, 
indicating that in the context of either a N- or a C-terminal 
CPG2 fusion, the heparin-binding domain of VEGF functions 
autonomously. When this region is excluded from the fusions, 
then the resulting proteins show only very weak heparin 

35 binding activity. 
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4.4 Purification of th e f^aion proteins by H«parjn-gepharppe 
»ff }Tiltv chrom atography 

Having shown that at least two of the fusions could bind to 
Heparin -Sepharose beads with high efficiency, this technique 
5 was used to purify the fusion proteins. Conditioned medium 
from COS cells transfected with the plasmid pEFCPV 1€5 was 
passed over a Heparin-Sepharose column. The column was 
washed and the bound proteins were eluted with buffer 
containing increasing concentrations of NaCl . The elution of 
10 the fusion protein was detected by assaying the fractions for 
CPG2 activity. 

When cos -conditioned medium containing CPV 165 is processed 
over a Heparin-Sepharose column, >96% of the protein loaded 
does not bind. However, all of the fusion protein bound to 

15 the column, as determined by CPG2 enzyme assay. When the 
column was washed with buffer containing 400mM NaCl, 50* of 
the bound column (2% of the protein in the load sample) 
eluted, and this fraction did not contain any of the CPG2 
activity (Fig 3) . When buffer containing BOOmM NaCl was 

20 applied to the column, the majority of the protein remaining 
(-2% of the protein loaded) eluted from the column. This 
fraction was found to contain the CPG2 activity and at 
higher NaCl concentrations, no more CPG2 activity was eluted 
(Fig 3) . 

25 The apparent yield of CPG2 enzyme activity eluted from the 
column is -140V. This suggests that the activity of the 
purified protein is greater than that of the unpurified 
protein, possibly due to the presence of an inhibitory 
activity present in the conditioned culture medium. This 

3 0 makes it impossible to assess the purification fold of the 
sample, however, the BOOmM NaCl fraction contains all of the 
CPG2 loaded and eluted from the column , but only -2% of the 
protein loaded. Thus, the Heparin-Sepharose column gives a 
> 50 fold purification, demonstrating the feasibility of this 

3 5 approach. 
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Expression of the fusion pr oteins in Sf9 in Bft? t ccIIb 

The bulk production of the fusion proteins was attempted in 
order to allow them to be further characterised. The insect 
cell system for this analysis was chosen because this system 
can produce high levels of foreign proteins when the cells 
are infected with recombinant baculoviruses. Insect cells 
usually allow correct formation of structures such as 
disulphide bonds and often perform post-translational 
modifications such as proteolytic cleavages and 
glycosylation, which occur in mammalian cells but are not 
seen in bacterial expression systems. Before progression to 
make the insect cell viruses, however, polyhistidine tags 
onto the C- termini of CPV 165 and CPV 109 were incorporated to 
allow these proteins to be purified by Ni++-NTA-Agarose 
affinity column chromatography. 

4 - s -l Cloning the baculoviruB vectora 

A polyhistidine tag was inserted on the 3' ends of the genes 
for CPV 165 and CPV 109 . 

««5 t ; y l CPV 1€5 fi t 

The cloning of cpv a6sH 6 was performed by using 
oligonucleotides 4 and 8 in a PCR reaction with pVEGF 165 . The 
PCR product was digested with the restriction endonucleases 
Kpnl and BamHI and this fragment was cloned back into 
pEFCPV 1€5 but in conjunction with oligonucleotides 10 and 11 
to generate the polyhistidine tag at the 3' end of the gene. 
This cloning strategy results in the addition of the 
polyhistidine tag to the 3' end of the VEGF moiety of the 
fusion gene and therefore to the C-terminus of the protein. 
In constructing this gene, the terminal four codons of the 
VEGF gene were removed. This construct is referred to as 
cpv i6i H c and the Protein is represented schematically in Fig 
1G. The sequence of the CPV a61 H 6 gene and the protein that it 
is -predicted to encode is shown in SEQ ID No. 16. 
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The cloning of CPV 109 H 6 was performed by using 
oligonucleotides 8 and 12 in a PCR reaction with pVEGF 165 . 
The PCR product was digested with the restriction 
endonucleases Kpnl and BamHI and this fragment was then 
5 cloned back into pEFCPV 165 but in conjunction with 
oligonucleotides 10 and 11 to generate the polyhistidine tag 
at the 3' end of the gene. This cloning strategy results in 
truncation of the VEGF gene at codon 110 the addition of the 
polyhistidine tag to that position. Thus the residues 
10 required for heparin-binding are removed. This construct is 
referred to as CPV 109 H 6 and the protein is represented 
schematically in Fig 1H. The sequence of the CPV 109 H 6 gene 
and the protein that it is predicted to encode is shown in 
SEQ ID No. 18. 

15 The genes for CPH 6# V 161 CPH 6 , V 115 CPH € , CPV 1€S H € and CPV 10P H 6 
were cloned into the insect cell shuttle vector pVLPlink.2. 
These vectors were used to produce recombinant Baculovirus 
particles following standard protocols. 

4,5.2 Expression of CPH C and the fusion proteins in SF1 

20 cells 

Sf9 insect cells were infected with recombinant viruses 
directing expression of CPH 6# V 161 CPH € , V 115 CPH € , CPV 165 H 6 and 
CPV 109 H 6 . Control infections were also performed using a 
virus which has an empty polyhedron locus (the 'empty' 
25 virus) . Since these cells are eukaryotic, all the proteins 
were expected to be secreted into the medium in which the 
cells were growing. We therefore examined the conditioned 
medium from the infected cells for the presence of CPG2 
enzyme activity using MTX as a substrate. 

3 0 Conditioned medium from cells infected with the empty virus 
are unable to degrade MTX (Table 4) . The conditioned medium 
frbm the cells infected with the empty virus was unable to 
degrade MTX, indicating the absence of any CPG2 acitivy in 
infected insect cells. By contrast, the medium from the 
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cells which were infected with the virus which expresses CPH 6 
accumulated CPG2 protein to a level of -0.4 units of CPG2 per 
mL (U/mL) of tissue culture medium (Table 4) . The medium 
from the cells infected with viruses with each of the other 
5 fusion proteins also accumulated CPG2 activity to between 
0.01 and 0.2 U/mL (Table 4), indicating the secretion of the 
fusion proteins. 

±jJL Ni++-Aqarose Af finity chromatography of the fusion 

10 We used the insect cell produced material to determine 
whether the polyhistidine tags could be used to purify the 
expressed proteins. For this analysis, we used CPH 6 and 
V 115 CPH 6 . Sf9 cells were infected with the appropriate 
viruses and the proteins secreted into the conditioned tissue 

15 culture medium were subjected to a purification protocol 
using Ni++-NTA- Agarose (Quiagen: used according to 
manufacturer's instructions). The conditioned media were 
dialysed to remove histidine present in the medium (which may 
interfere with binding of H6-proteins because histidine has 

2 0 an imidazole ring) and then passed over the column. The 
column was washed and the bound proteins were eluted with 
imidazole. Only 6% of the protein in the conditioned medium 
bound to the Ni++-NTA column, and -3 3% of the bound protein 
was eluted from the column in the wash cycles (Table 5) . 

25 When the flow through and wash fractions were assayed for 
CPG2 activity, none could be detected in these fractions. In 
the case of both CPH € and V 115 CPH 6 , the amount of protein that 
eluted from the 90mM imidazole wash represented -4% of the 
protein loaded onto the column (Table 5) . When this was 

30 assayed, it was found to contain -99% of the CPG2 activity 
loaded onto the column in the case of CPH 6 and -100% of the 
CPG2 activity loaded onto the column in the case of V 115 CPH 6 . 
Thus, the recovery from these columns is excellent. 

In order to determine the purity of the eluted protein, the 
35 samples were subjected to SDS-PAGE and silver staining. The 
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results show that both the preparations of CPH 6 and V 115 CPH 6 
were extremely pure, representing >95% of the proteins in the 
purified samples (Fig 4) . This demonstrates the feasibility 
of using this approach to purify the polyhistidine tagged 
5 proteins and also underscores the fact that the insect cells 
can produce large amounts of the fusion proteins which are 
secreted and accumulate to represent -4% of the protein 
present in the medium and hence the fold purification 
required to prepare pure samples is only -25. 

10 Using this protocol, CPH 6/ V 161 CPH 6 , V 115 CPH fi and CPV 109 H 6 were 
all purified from Sf9 cells by Ni++-NTA-Agarose affinity 
chromatography as described above. These samples were used 
to determine the affinity of the CPG2 moiety for the 
substrate MTX . The results show that the Km of all of the 

15 fusion proteins is similar to that seen for CPH 6 (Table 6), 
which is similar to the Km determined for bacterially 
produced CPG2 of ~10/xM . This indicates that fusion of CPG2 
to VEGF does not impair the affinity of CPG2 for its 
substrate . 

2 0 4.7 Dimer analysis o f fusion proteins 

VEGF produced in mammalian cells is a dimeric protein which 
is stabilised by inter-molecular and intra -molecular cysteine 
bridges. CPG2 is also a dimeric protein, but these dimers 
are stabilised by non-covalent interactions. In order to 
25 determine whether the VEGF/CPG2 fusion proteins which were 
produced in Sf9 cells are dimers stabilised by cysteine 
bridges, a dimer analysis was performed. CPH 6 and V 115 CPH € , 
both produced in insect cells were heated either in the 
presence or absence of the reducing agent 2-mercaptoethanol . 

3 0 The resultant proteins were resolved by SDS-PAGE and 

subjected to immunoprotein blot analysis using the CPG2 
specific antiserum. The results show that following heating, 
either in the presence or absence of reducing agents, CPH e 
migrates as a protein with a mass of ~45kDa (Fig 5, lanes 2, 
3 5 4) , showing that this protein does not exist in the form of 
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dimers stabilised by cysteine bridges. V 115 CPH € , by contrast 
is seen as a protein with a Mr of -120 kDa in the absence of 
reducing agents, but as a protein with a Mr of -6 0 kDa in the 
presence of reducing agents (Fig 5, lanes 1, 3) . These data 
show that V 115 CPH 6 produced in Sf 9 cells is a dimeric protein 
which is stabilised by cysteine bridges, as in mammalian 
cells . 

±jA Binding of the Fusion prot eins to the VEGF receptor 

We used the insect cell expressed protein to determine 
whether the fusion proteins could bind to the VEGF receptor 
by biochemical methods. For this analysis, we developed an 
in vitro assay in which the ability of the fusion proteins to 
bind to the external domain of the VEGF receptor, KDR. 
Receptor tyrosine kinases, of which KDR is a member, span the 
plasma membrane and consist of three domains. The external 
domain which are usually highly glycosylate contain the 
growth factor binding domain and thus are the sites of 
specific interaction with their ligands. There is then a 
short transmembrane domain, made of a single amino acid chain 
and the C-terminus which is located in the cytoplasm consists 
of the tyrosine kinase domain. In the assay we developed, we 
wished to use the extracellular domain (ECD) as a specific 
probe for VEGF binding. In order to achieve this, we cloned 
the KDR external domain and .transmembrane domain for 
expression in SF9 cells. The internal domain was removed in 
this cloning and a tag for the monoclonal antibody 9E10 was 
added to the 3 'end of the cloned fragment and this construct 
is referred to as the KDR binding domain (BD) (See Fig 6A) . 
The rational for this approach was to express the KDR (BD) in 
Sf9 cells and prepare extracts from the infected cells. The 
KDR (BD) could then be immunoprecipitated with the 9E10 
monoclonal antibody and used as a probe to detect binding of 
the VEGF/CPG2 fusions to the VEGF receptor. 



4. 9. 3- Cloning pf KPR pp 
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For this approach, the KDR gene in the plaemid pBK- 
CMVJE/KDRsense was used. PCR, using oligonucleotides 13 and 
14 in conjunction with the KDR gene was used to create an Nco 
1 restriction endonuclease site at the 5' end of the gene. 
5 The KDR(BD) was then cloned as an Nco 1/ Eco RI fragment into 
the baculovirus vector pVLPlink.2. The Eco RI site in KDR is 
located at codons 809 and 810 of KDR which is just on the 3' 
end of the sequence encoding for the transmembrane domain 
(See Fig 6A) . A tag for the 9E10 monoclonal antibody was 
10 cloned into the Eco RI site using standard cloning techniques 
with oligonucleotides 15 and 16. The sequence of this KDR 
fragment, referred to as KDR(BD) is shown in SEQ ID No. 20 
and it is represented shcematically in Fig 6A. 

4.8.2. Expression of KDR BP in Sf9 cells 

15 The expression of the KDR(BD) in insect cells was verified by 
immunoprotein blotting of infected insect cells using the 
9E10 monoclonal antibody. The results show that in cells 
infected with the KDR(BD) virus there is a band with a Mr of 
~120kDa which is recognised by the 9E10 monoclonal antibody, 

20 which is not present in extracts from cells infected with the 
empty virus (Fig 7) . The size of this band of is greater 
than the size expected for the cloned fragment (expected size 
= 92,769 Da) and this is probably due to glycosylation of 
KDR(BD) . The data show that KDJUBD) can be expressed in Sf9 

25 cells. 

4.8.3. finding of fusion proteins to the KDR (BP) 

The abilities of the four VEGF/CPG2 fusion proteins to bind 
to KDR(BD) were tested. For this assay, extracts from insect 
cell that had been infected with either the KDR(BD) 

30 expressing virus, or a control (empty) virus were incubated 
with the 9E10 monoclonal antibody immobilised on Protein G- 
Sepharose beads, to precipitate the KDR (BD) . The 
immunocomplexes were washed and then incubated with CPH 6 , 
V 161 CPH 6 , V 115 CPH 6 , CPV 161 H 6 and CPV 109 H 6 protein which had been 

35 expressed in insect cells and purified by Ni++-NTA Affinity 
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chromatography. The complexes were washed once more and the 
presence of the fusion proteins judged measuring CPG2 
activity in the immunocomplexes . 

CPH 6 , was found not to interact with the KDR(BD), since no 
5 CPG2 activity is found in KDR(BD)/9E10 immunocomplexes (Fig 
8, lane 6). However, V 161 CPH € , V 115 CPH 6 , CPV 1€1 H 6 and CPV 109 H € 
were all able to bind to the KDR(BD) as seen by the fact that 
with each of these fusions, CPG2 acitivty could be detected 
associated with the KDR(BD)/9E10 imtnunocomplex (Fig B, lanes 

10 7-10) . That this interaction was specific could be shown by 
the fact that when the 9E10 immunocomplexes were performed 
with extracts from Sf9 cells infected with empty virus 
instead of the KDR(BD) expressing virus, no binding of the 
fusion proteins could be detected. (Fig 15, lanes 2-5) These 

15 data establish that all four of the fusion proteins can 
interact specifically with the ligand binding domain of KLR, 
whereas CPH € cannot . 

L± CELL CYTOTOXICITY ASSAYS 

Taken together, the above data establish that fusion proteins 

2 0 can be constructed between CPG2 and VEGF in two orientations 

and with VEGF moities that do, or do not, contain the 
heparin-binding domain of VEGF. These proteins are secreted 
by eukaryotic cells and are stable in terms of VEGF functions 
(receptor binding and where appropriate, heparin-binding) and 
25 in terms of CPG2 enzyme activity. In order to prevent 
glycosylation of the CPG2 moiety, the sites of inappropriate 
glycosylation on CPG2 were mutated. This results in a 
protein with only -10% enzyme activity compared to the wild 
type protein, but its affinity for its substrate is 

3 0 indistinguishable from bacterially expressed wild type CPG2 

(see WO96/03515) . The presence of a polyhistidine tag at the 
C-terminus of the fusions allows the purification of the 
fusions by Ni++-NTA Agarose chromatography and the fusions 
which contain the heparin-binding domain of the VEGF moiety 
35 can be purified on Heparin -Sepharose columns. We therefore 
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progressed to an analysis of the ability of the fusion 
proteins to direct prodrug dependent cytotoxicity of 
mammalian cells. 

5.1 Non- specific cytotoxicity assay 

For the first assay, we wished to establish that CPH6 and the 
fusion proteins were able to direct prodrug dependent 
cyotoxicity of NIH3T3 cells, in a non- targeted manner. For 
this experiment, CPH € , V 1€1 CPH 6 , V 115 CPH 6 , CPV 165 H 6 and CPV 109 H 6 
were all purified by Ni++-NTA Agarose. Samples of the 
purified proteins were added to NIH3T3 cells in the presence 
of the CMDA prodrug and the effects on cell viability were 
determined by measuring cell growth by [3H] -thymidine 
incorporation . 

Treatment of these cells with CMDA alone does not 
significantly affect the growth of these cells {Fig 9, lanes 
1, 2). When these cells are treated with sub-nanomolar 
concentrations of CPH 6 or each of the fusion proteins in the 
presence of CMDA, complete cell death occurs (Fig 9, lanes 3- 
7) . Thus, it can be seen that each of the fusion proteins is 
highly efficient in directing the specific killing of 
mammalian cells in the presence of the CMDA prodrug. 

5 . 2 VEGF-directed cytotoxicity assay 

The fusion proteins were assesed for their ability to direct 
the CMDA- dependent killing of cells which express specific 
receptors for VEGF on their surface. For this analysis, we 
employed human umbilical vein endothelial (Hu-V-ec) cells, 
which are dependent for growth on VEGF (Prog Growth Factor 
Res, £, 89, 1994) . These cells are not significantly 
sensitive to CMDA as judged by the effects of ImM CMDA on 
their growth rate (Fig 10A) . The cells were incubated in the 
presence of CPH 6 , or V 115 CPH € for 3 0 min to allow the fusion 
proteins to bind to the cell surface and then the cells were 
washed to remove unbound protein. CMDA was added and cell 
survival was determined by [3H] -thymidine incorporation. 
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When the analysis is performed with CPH 6 , there is no 
significant affect on the growth rate of the cells (Fig 10B, 
lanes 1, 2). By contrast, when the analysis is performed 
with V 115 CPH 6 , there is a >90% inhibition of cell growth (Fig 
5 10B, lanes 1, 3) . These data show that CPH € does not appear 
to bind to the surface of Hu-V-ec cells and so is unable to 
direct their killing in the presence of the prodrug, as it is 
washed off the cells prior to the addition of the prodrug. 
However, V 115 CPH 6 remains bound to the cell surfaces, is not 
10 washed off the cells and so is able to direct their killing 
when the prodrug is added. This provides a proof of the 
feasability of the LIDEPT approach. 
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SEQUENCE INFORMATION 

In the sequences SEQ ID NOS. 1 to 20, the DNA sequences are 
presented in the conventional 5' -> 3' orientation and the 
protein sequences in the conventional N- -> C-terminal 
orientation as indicated. 

SEP ID Nob. 3 a nd 4 ;DNA and Protein Sequences of CPG2 

The DNA sequence is presented and below it, the predicted 
protein sequence. Within the DNA sequence, the engineered 
restriction endonuclease sites are shown in lower case. The 
position of the Nco 1 site at the 5' end of the gene is also 
indicated. Four mutations within the gene are indicated by 
*. The mutation at *1 represents a silent mutation at the 
Asn codon at position 172 of CPG2 which occurred during the 
PCR cloning of this gene. The mutations at *2, *3 and *4 are 
the Asn -> Gin mutations required to prevent glycosylation of 
CPG2 in mammalian cells. Numbers below the protein sequence 
indicate the codons of the genes from which the individual 
fragments are derived. The region derived from c-erbB2 is 
underlined and this contains codons 1 to 17 of that gene, 
which encompasses the signal peptide used to direct this 
protein into the secretory pathway. The region derived from 
CPG2 contains codons 23 to 415. The polyhistidine tag at the 
C-terminus of the clone is shown in italics within the 
protein sequence and the stop cbdon is indicated by •. 

SEQ ID Nos 5 and 6 : DNA and Protein Sequences of VEGF. 

The DNA sequence is presented and below it, the predicted 
protein sequence. Within the protein sequence, the signal 
peptide which is proteolytically cleaved from the mature 
protein is underlined. The amino-acid numbering therefore 
begins with residue -26 to indicate that fact, and residue 
1 is the first amino acid in the mature protein. The portion 
of -the single glycosylation site is indicated by *. The stop 
codon is indicated by e. 
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In the fusion protein sequences the DNA sequence is presented 
and below it, the predicted protein sequence. The VEGF 
derived sequences are shown in bold type, and the CPG2 
derived sequences in normal type. The two DNA in SEQ ID Nos. 
5 7, 9, 11 and 13 sequences are taken from SEQ ID Nos. 3 and 5. 

SEQ ID Nob* 7 and 8 ; DNA and Protein Sequences of V 1 T 1 CML 
A silent mutation within the VEGF sequence at codon 2 (CCC- 
>CCT) of the mature protein was created to destroy the Nco I 
site located in this gene (*) . The engineered restriction 

10 endonuclease sites are shown in lower case. The numbers 
below the protein sequence indicate the codons for the genes 
from which the individual fragments are derived. The VEGF 
sequences are from -26 to 161 and the CPG2 sequences from 23 
to 415. The signal peptide from VEGF which is cleaved 

15 proteolytically from the mature protein is underlined. The 
polyhistidine tag is shown in italics within the protein 
sequence and the stop codon indicated by •. 

SEP ID Nos. 9 and 10 ; DNA and Prote in Sequences of V,,«CPH< 

A silent mutation within the VEGF sequence at codon 2 (CCC- 
2 0 >CCT) of the mature protein was created to destroy the Nco I 
site located in this gene (*) . Additionally, the VEGF gene 
has been truncated at codon 115 as indicated by the numbering 
below the protein sequence. The engineered restriction 
endonuclease sites are shown in lower case. The numbers 
25 below the protein sequence indicate the codons for the genes 
from which the individual fragments are derived. The VEGF 
sequences are from -26 to 115 and the CPG2 sequences from 23 
to 415. This fusion therefore does not contain the amino 
acids responsible for heparin -binding of VEGF. The signal 
30 peptide from VEGF which is cleaved proteolytically from the 
mature protein is underlined. The polyhistidine tag is shown 
in italics within the protein sequence and the stop codon 
indicated by •. 
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SEP ID Nos. 11 and 12 ! DNA and P rotein Sequences of CPV. M 
A peptide spacer was generated by additional DNA sequences 
between the two, encompassed by the Eco RI and Kpn 1 sites. 
The spacer peptide which these encode are indicated in 
italics and underlined, within the protein sequence. The 
cloning of this construct resulted in the loss of the 
polyhistidine tag from the 3' end of the CPG2 derived 
sequences and also in the removal of the sequences for the 
VEGF signal peptide. At the 5' end of the gene are the first 
27 codons of c-erbB2, followed by CPG2 amino acids 23-415, 
fused via the peptide spacer to VEGF amino acids 1-165. The 
stop codon is indicated by 

SEP ID Nob 13 and 14 : DNA and Protein Sequences of CPV^ f 

A peptide spacer was generated by additional DNA sequences 
between the two, encompassed by the Eco RI and Kpn 1 sites. 
The spacer peptide which these encode are indicated in 
italics and underlined, within the protein sequence. The 
cloning of this construct resulted in the loss of the 
polyhistidine tag from the 3' end of the CPG2 derived 
sequences and also in the removal of the sequences for the 
VEGF signal peptide. This clone was created when, a PCR 
generated mutation occurred during the creation of CPV165. 
This converts the Arg codon 110 of mature VEGF into a stop 
codon, resulting in truncation of the VEGF fraction at codon 
109 as indicated in the protein sequence derived from this 
clone (*) . At the 5' end of the gene are the first 27 codons 
of c-erbB2, followed by CPG2 amino acids 23-415, fused via 
the peptide spacer to VEGF amino acids 1-109. The mutation 
resulting in the stop codon is indicated by 

SEP ID Nos. 15 and 16 : DNA and Protein Sequences of CPV 1 T .H, 
The DNA sequences of VEGF and CPG2 are taken from SEQ ID No. 
11. A peptide spacer was generated by additional DNA 
sequences between the two, encompassed by the Eco RI and Kpn 
1 sites. The spacer peptide encode is indicated in italics 
and underlined, within the protein sequence. In this clone, 
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a polyhistidine tag has been included in the gene at the 
position 3' to codon 161 of mature VEGF by creating a Bam HI 
site at codon 162 of mature VEGF as shown. The polyhistidine 
tag was then cloned into this position and this encodes for 
5 a C- terminal protein tag as shown in italics at the end of 
the protein sequence. During the cloning, for convenience, 
an Eco RI site was created 3' to the Bam HI site as 
indicated. At the 5' end of the gene are the first 27 codons 
of c-erbB2, followed by CPG2 amino acids 23-415, fused via 
10 the peptide spacer to VEGF amino acids 1-161 and then the 
polyhistidine tag. The stop codon is indicated by #, 

SEP ID Nob. 17 and 18 t DNA and Protein Sequences of CPV^JL 
The two DNA sequences of VEGF and CPG2 are taken from SEQ ID 
No 13 . A peptide spacer was generated by additional DNA 
sequences between the two, encompassed by the Eco RI and Kpn 
1 sites. The spacer peptide which these encode is indicated 
in italics and underlined, within the protein sequence. In 
this clone, a polyhistidine tag has been included in the gene 
at the position 3' to codon 109 of mature VEGF by creating a 
Bam HI site at codon 110 of mature VEGF as shown. The 
polyhistidine tag was then cloned into this position and this 
encodes for a C-terminal protein tag as shown in italics at 
the end of the protein sequence. During the cloning, for 
convenience, an Eco RI site was created 3' to the Bam HI site 
as indicated. At the 5' end of the gene are the first 27 
codons of c-erbB2, followed by CPG2 amino acids 23-415, fused 
via the peptide spacer to VEGF amino acids 1-109 and then the 
polyhistidine tag. The stop codon is indicated by 

SEQ ID NOS* 19 AND 20 ; DNA AND PROTEIN SEQUENCE OF TOR (BP) 
30 The position of the engineered Nco 1 site at the 5' end and 
the endogenous Eco RI site at codons 809 and 610 are 
indicated. The signal peptide is indicated in the protein 
sequence as the underlined region. The transmembrane region, 
located between codons 746 to 770 and the protein sequence is 
35 underlined and in italics. The DNA sequence used to encode 



20 
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the C-terminal 9E10 tag is Bhown in lower case and the 
protein sequence it encodes is in italics. This region 
encodes for amino acids 1-810 of KDR fused to the 9E10 
epitope. The stop codon indicated by 
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SEQUENCE INFORMATION 



Sequences of 
CPG 2 



SEQ ID NO. 1: Genomic DNA 

SEQ ID NO. 2: Protein Sequence 

ATCATGGATC CACGCACTGA AGGC6CGC66 CAAGACGCGC GGCGTGGCGA CGCTGTGCAT 

CGGCGGGGGC GAAGGCACCG CAGTGGCACT CGAATTGCTA TAAGAACCAT GGCTGGGGAC 

GCCCGACAAC AGGCGTCCAC CAGCTTTTTT CATTCCGACA ACCCGAACGA ACAATGCGTA 

GAGCAGGAGA TTCC ATG CGC CCA TCC ATC CAC CGC ACA GCC ATC GCC GCC 
Met Arg Pro Ser He His Arg Thr Ala lie Ala Ala 
1 5 10 

GTG CTG GCC ACC GCC TTC GTG GCG GGC ACC GCC CTG GCC CAG AAG CGC 
Val Leu Ala Thr Ala Phe Val Ala Gly Thr Ala Leu Ala Gin Lys Arg 
15 20 25 

GAC AAC GTG CTG TTC CAG GCA GCT ACC GAC GAG CAG CCG GCC GTG ATC 
Asp Asn Val Leu Phe Gin Ala Ala Thr Asp Glu Gin Pro Ala Val He 
30 35 40 

AAG ACG CTG GAG AAG CTG GTC AAC ATC GAG ACC GGC ACC GGT GAC GCC 
Lys Thr Leu Glu Lys Leu Val Asn He Glu Thr Gly Thr Gly Asp Ala 
45 50 55 60 

GAG GGC ATC GCC GCT GCG GGC AAC TTC CTC GAG GCC GAG CTC AAG AAC 
Glu Gly He Ala Ala Ala Gly Asn Phe Leu Glu Ala Glu Leu Lys Asn 
65 70 75 

CTC GGC TTC ACG GTC ACG CGA AGC AAG TCG GCC GGC CTG GTG GTG GGC 
Leu Gly Phe Thr Val Thr Arg Ser Lys Ser Ala Gly Leu Val Val Gly 
80 85 90 

GAC AAC ATC GTG GGC AAG ATC AAG GGC CGC GGC GGC AAG AAC CTG CTG 
Asp Asn He Val Gly Lys He Lys Gly Arg Gly Gly Lys Asn Leu Leu 
95 100 105 

CTG ATG TCG CAC ATG GAC ACC GTC TAC CTC AAG GGC ATT CTC GCG AAG 
Leu Met Ser His Met Asp Thr Val Tyr Leu Lys Gly lie Leu Ala Lys 
110 115 120 

GCC CCG TTC CGC GTC GAA GGC GAC AAG GCC TAC GGC CCG GGC ATC GCC 
Ala Pro Phe Arg Val Glu Gly Asp Lys Ala Tyr Gly Pro Gly lie Ala 
125 130 135 14U 

GAC GAC AAG GGC GGC AAC GCG GTC ATC CTG CAC ACG CTC AAG CTG CTG 
Asd Asd Lys Gly Gly Asn Ala Val He Leu His Thr Leu Lys Leu Leu 
p * 14 £ 150 155 
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AAG 6AA TAC GGC GTG CGC GAC TAC GGC ACC ATC ACC GTG CTG TTC AAC 710 
Lys Glu Tyr Gly Val Arg Asp Tyr Gly Thr He Thr Val Leu Phe Asn 
160 165 170 

ACC GAC GAG GAA AAG GGT TCC TTC GGC TCG CGC GAC CTG ATC CAG GAA 758 
Thr Asp Glu Glu Lys Gly Ser Phe Gly Ser Arg Asp Leu He Gin Glu 
175 180 185 

GAA GCC AAG CTG GCC GAC TAC GTG CTC TCC TTC GAG CCC ACC AGC GCA 806 
Glu Ala Lys Leu Ala Asp Tyr Val Leu Ser Phe Glu Pro Thr Ser Ala 
190 195 200 

GGC GAC GAA AAA CTC TCG CTG GGC ACC TCG GGC ATC GCC TAC GTG CAG 854 
Gly Asp Glu Lys Leu Ser Leu Gly Thr Ser Gly lie Ala Tyr Val Gin 
205 210 215 220 

GTC AAC ATC ACC GGC AAG GCC TCG CAT GCC GGC GCC GCG CCC GAG CTG 902 
Val Asn He Thr Gly Lys Ala Ser His Ala Gly Ala Ala Pro Glu Leu 
225 230 235 

GGC GTG AAC GCG CTG GTC GAG GCT TCC GAC CTC GTG CTG CGC ACG ATG 950 
Gly Val Asn Ala Leu Val Glu Ala Ser Asp Leu Val Leu Arg Thr Met 
240 245 250 

AAC ATC GAC GAC AAG GCG AAG AAC CTG CGC TTC AAC TGG ACC ATC GCC 998 
Asn lie Asp Asp Lys Ala Lys Asn Leu Arg Phe Asn Trp Thr He Ala 
255 260 265 

AAG GCC GGC AAC GTC TCG AAC ATC ATC CCC GCC AGC GCC ACG CTG AAC 1046 
Lys Ala Gly Asn Val Ser Asn He He Pro Ala Ser Ala Thr Leu Asn 
270 275 280 

GCC GAC GTG CGC TAC GCG CGC AAC GAG GAC TTC GAC GCC GCC ATG AAG 1094 
Ala Asp Val Arg Tyr Ala Arg Asn Glu Asp Phe Asp Ala Ala Met Lys 
285 290 295 300 

ACG CTG GAA GAG CGC GCG CAG CAG AAG AAG CTG CCC GAG GCC GAC GTG 1142 
Thr Leu Glu Glu Arg Ala Gin Gin Lys Lys Leu Pro Glu Ala Asp Val 
305 310 315 

AAG GTG ATC GTC ACG CGC GGC CGC CCG GCC TTC AAT GCC GGC GAA GGC 1190 
Lys Val He Val Thr Arg Gly Arg Pro Ala Phe Asn Ala Gly Glu Gly 
320 325 330 

GGC AAG AAG CTG GTC GAC AAG GCG GTG GCC TAC TAC AAG GAA GCC GGC 1238 
Gly Lys Lys Leu Val Asp Lys Ala Val Ala Tyr Tyr Lys Glu Ala Gly 
335 340 345 

GGC ACG CTG GGC GTG GAA GAG CGC ACC GGC GGC GGC ACC GAC GCG GCC 1286 
Gly Thr Leu Gly Val Glu Glu Arg Thr Gly Gly Gly Thr Asp Ala Ala 
350 355 360 

TAC GCC GCG CTC TCA GGC AAG CCA GTG ATC GAG AGC CTG GGC CTG CCG 1334 
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Tyr Ala Ala Leu Ser Gly Lys Pro Val He Glu Ser Leu Gly Leu Pro 
365 370 375 380 

GGC TTC GGC TAC CAC AGC GAC AAG GCC GAG TAC GTG GAC ATC AGC GCG 1382 
Gly Phe Gly Tyr His Ser Asp Lys Ala Glu Tyr Val Asp He Ser Ala 
385 390 395 

ATT CCG CGC CGC CTG TAC ATG GCT GCG CGC CTG ATC ATG GAT CTG GGC 1430 
He Pro Arg Arg Leu Tyr Met Ala Ala Arg Leu He Met Asp Leu Gly 
400 405 410 

GCC GGC AAG T GAATGCTGCC CCCCGGCTTT TCACTCGCGT TGCTCGTGTA 1480 
Ala Gly Lys 
415 

ACTCCACCCC CCGAGGGGGA GGCGCGGTCC GCCTTGGGGC GGCCCGGCGG CGACCGCCTC 1540 

GTCACATAGA AGGAACTGCC ATGTTGTTGA CAGCAGACCA GGAAGCCATC CGCGACGCGG 1500 

TGCGCGACTT CTCGCAAGCC GAACTCTGGC CCAACGCCGC GAATGGGGAC CGCGAGCACA 1560 

GCTTTCCCAA GAGCCCACCA GGCCGTCGGC TGGCGTACGC AGTCTGCGTG CCCGAGGAGC 1720 

ATGGCGGCGC CGGCCTCGAC TACCTCACCT CGCGCTGGTG CTGGAGGAGA TCGCGGCCGG 1780 

CGACGGCGGC ACCAGCACCG CCATCAGCGT GACCAACTGC CCCGTCAACG CCATCCTCAT 1340 

GCGCTACGGC AACGCGCAGC AGAAGAAGCA GTGGCTCGAG CCGCTGGCGC AGGGCCGGAT 1900 

GCTCGGCGCC TTCTGCCTGA CCGAGCCGCA GGCCGGCAGC GATGCATCGA GCCTGCGCAC 1960 

CACGGCGCGC AAGGACGGCG ACGGCTACGT GATCGACGGC GTGAAGCAGT TCATCACCAG 2020 

CGGCAAGAAC GGCCAGGTGG CGGGATCC 2048 
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Sequences of CPH* 

SEQ ID NO.3: DNA Sequence 
SEQ ID NO.4: Protein Sequence 

Nco 1 

5' CCATGGAGCTGGCGGCCTTGTGC 
N- MPi-r,1uL€\ ^1f»&1 »T,miCvs 
1 

CGCTGGGGGCTCCTCCTCGCCCTCTTGCCC CCCGGAGCCGCGAGCACCCAAGTGTGCACC 
AraTrpGl vLeuLeuLeuAl aLeuLeuPro p-roflT vAlaAlaSerThrGlnValCYSTnr 



gaatCCGCCCTGGCCCAGAAGCGCGACAAC GTGCTGTTCCAGGCAGCTACCGACGAGCAG 
GlySerAlaLeuAlaGlnLysArgAspAsn ValLeuPheGlnAlaAlaThrAspGluGln 
23 

CCGGCCGTGATCAAGACGCTGGAGAAGCTG GTCAACATCGAGACCGGCACCGGTGACGCC 
PrbAlaVallleLysThrLeuGluLysLeu ValAsnlleGluThrGlyThrGlyAspAle 



GAC-GGCATCGCCGCTGCGGGCAACTTCCTC GAGGCCGAGCTCAAGAACCTCGGCTTCACG 
GluGlylleAlaAlaAlaGlyAsnPheLeu GluAlaGluLeuLysAsnLeuGlyPheThr 



GTCACGCGAAGCAAGTCGGCCGGCCTGGTG G TGGGCGAC AACATCGTGGG CAAGATC AAG 
ValThrArgSerLysSerAlaGlyLeuVal ValGlyAspAsnlleValGlyLysIleLys 



GGCCGCGGCGGCAAGAACCTGCTGCTGATG TCG C AC ATGGAC ACCGTC TACCTC AAGGG-C 
GlyAxgGlyGlyLysAsnLeuLeuLeuMet SerHisMetAspThrValTyrLeuLysGly 



VTTCTCGCGAAGGCCCCGTTCCGCGTCGAA GGCGACAAGGCCTACGGCCCGGGCATCGCC 
IleLeuAlaLysAlaProPheArgValGlu GlyAspLysAlaTyrGlyProGlylleAl a 



GACGACAAGGGCGGCAACGCGGTCATCCTG CACACGCTCAAGCTGCTGAAGGAATACGGC 
AspAspLysGlyGlyAsnAlaVallleLeu HisThrLeuLysLeuLeuLysGluTyrGly 

*1 

GTGCGCGACTACGGCACCATCACCGTGCTG TTCAATACCGACGAGGAAAAGGGTTCCTTC 
ValArgAspTyrGlyThrlleThrValLeu PheAsnThrAspGluGluLysGlySerPhe 



GGCTCGCGCGACCTGATCCAGGAAGAAGCC AAGCTGGCCGACTACGTGCTCTCCTTCGAG 
GlySerArgAspLeuIleGlnGluGluAla LysLeuAlaAspTyrValLeuSerPheGlu 



CCCACCAGCGCAGGCGACGAAAAACTCTCG CTGGGCACCTCGGGCATCGCCTACGTGCAG 
ProThrSerAlaGlyAspGluLysLeuSer LeuGlyThrSerGlylleAlaTyrValGln 



GTCCAAATCACCGGCAAGGCCTCGCATGCC GGCGCCGCGCCCGAGCTGGGCGTGAACGCG 
ValGlnlleThrGlyLysAlaSerHisAla GlyAlaAlaProGluLeuGlyValAsnAla 
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CTGGTCGAGGCTTCCGACCTCGTGCTGCGC ACGATGAACATCGACGACAAGGCGAAGAAC 
LeuValGluAlaSerAspLeuValLeuArg ThrMetAsnlleAspAspLysAlaLysAsn 

CTGCGCTTCCAATGGACCATCGCCAAGGCC GGCCAAGTCTCGAACATCATCCCCGCCAGC 
LeuArgPheGlnTrpThrlleAlaLysAla GlyGlnValSerAsnllelleProAlaSer 



GCCACGCTGAACGCCGACGTGCGCTACGCG CGCAACGAGGACTTCGACGCCGCCATGAAG 
Al aThr L euAs nAl aAsp Val ArgTyr Al a ArgAs nG 1 uAspPheAspAl aAlaMe t Ly s 



ACGCTGGAAGAGCGCGCGCAGCAGAAGAAG CTGCCCGAGGCCGACGTGAAGGTGATCGTC 
ThrLeuGluGluArgAlaGlnGlriLysLys LeuProGluAlaAspvalLysVallleVal 



aCgcgcggccgcccggccttcaatgccggc GAAGGCGGCAAGAAGCTGGTCGACAAGGCG 
ThrArgGlyArgProAlaFheAsnAlaGly GluGlyGlyLysLysLeuValAspLysAla 



GTGGCCTACTACAAGGAAGCCGGCGGCACG CTGGGCGTGGAAGAGCGCACCGGCGGCGGC 
ValAlaTyrTyrLysGluAlaGlyGlyThr LeuGlyValGluGluArgThrGlyGlyGly 



ACCGACGCGGCCTACGCCGCGCTCTCAGGC AAGCCAGTGATCGAGAGCCTGGGCCTGCCG 
ThrAspAlaAlaTyrAlaAlaLeuSerGly LysProVallleGluSerLeuGlyLeuPro 



GGCTTCGGCTACCACAGCGACAAGGCCGAG TACGTGGACATCAGCGCGATTCCGCGCCGC 
GlyPheGlyTyrHisSerAspLysAlaGlu TyrValAspIleSerAlalleProArgArg 

Eco RI 

CTGTACATGGCTGCGCGCCTGATCATGGAT CTGGGCGCCGGCAAGgaat t oCATCATCAC 
LeuTyrMe t Al aAlaArgLeuI 1 eMe t Asp LeuG lyAlaGlyLysG luPheHi sHi sHi s 

415 

XhSLl 

CACCA TCA CGCTTCCTAGt c t ag a 3 ' 
HisHisHisAlaSer ♦ -C 
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SEQUENCES OF VEGF 1K5. 

SEQ ID NO. 5 : DNA SEQUENCE 
SEQ ID TO. 6 : PROTEIN SEQUENCE 

5 ' ATGGACTTTCTGCTGTCTTGGGTGCATTGGAGCCTTGCCTTGCTGCTCTA 

N-MetAspPhel^u LeuSerTrpValHIaTrpSerlieuAlaLeuLeuLeulVrLeuHisHis 

-26 

Nco 1 

GCCAAGTGGTCCCAGGCTGCACCCATGGCA GAAGGAGGAGGGCAGAATCATCACGAAGTG 
AlaLvsTrpSerGlnAlaAlaPrQM^h^i * AspGlyGlyGlyGlnAsnHisHi sGluVal 

-1 1 

GTGAAGTTCATGGATGTCTATCAGCGCAGC T ACTGCCAT CCAATCGAG ACC C TGGTGGAC 
ValLysPheMetAspValTyrGlnArgSer TyrCysHisProIleGluThrLeuValAsp 



ATCTTCCAGGAGTACCCTGATGAGATCGAG TACATCTTCAAGCCATCCTGTGTGCCCCTG 
IlePheGlnGluTyxProAspGluIleGlu TyrllePheLysProSerCysValProLeu 



ATGCGATGCGGGGGCTGCTGCAATGACGAG GGCCTGGAGTGTGTGCCCACTGAGGAGTCC 
MetArgCysGluGluCysCysAsnAspGlu GlyLeuAspCysValProThrGluGluSer 



AACATCAC CATGCAGATTATGCGGATCAAA C C TCAC CAAGGC CAGCACAT AGGAGAGATG 
AsnlleThrMetGlnlleMetArglleLys ProHisGlnGlyGlnHisIleGlyGluMet 

AGCTTCCTACAGCACAACAAATGTGAATGC AGACCAAAGAAAGATAGAGCAAGACAAGAA 
SerPhelieuGlnHisAsnLysCysGluCys ArgProLysLysAspArgAlaArgGlnGlu 



AATCCCTGTGGGCCTTGCTCAGAGCGGAGA AAGCATTTGTTTGTACAAGATCCGCAGACG 
AsnProCysGlyProCysSerGluArgArg LysHisLeuPheValGlnAspProGlnThr 



TGTAAATGTTCCTGCAAAAACACAGACTCG CGTTGCAAGGCGAGGCAGCTTGAGTTAAAC 
CysLysCysSerCysLysAsnThrAspSer ArgCysLysAlaArgGlnLeuGluLeuAsn 



GAACGTACTTGCAGATGTGACAAGCCGAGG CGGTGA3 ' 
GluArgThxCysArgCysAspLysProArg Arg ♦ -C 

165 
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SEQ UENCES OF V ^CPHg. 

SEQ ID NO. 7 : DNA SEQUENCE 

SBQ ID NO. 8 : PROTEIN SEQUENCE Ncol 

5 ■ CCATGGACTTTCTGCTGTCTTGG 
N- MetAsppheLeuLeuSerTrp 

GTGCATTGGAGCCTTGCCTTGCTGCTCTAC CTCCACCATGCCAAGTGGTCCCAGGCTGCA 
ValHIaTrpgerLeuAlaLeuLeuLeuTvr La uHI BHisAlaLvsTrnflerGlnAl a A T * 



CCTATGGCAGAAGGAGGAGGGCAGAATCAT 
ProMetAlaAepGlyGlyGlyGlnAsnHis 

CAGCGCAGCTACTGCCATCCAATCGAGACC 
GlnArgSerTyrCysHi sProlleGluThr 

GAGATCGAGTACATCTTCAAGCCATCCTGT 
GluIleGluTyrllePheLysProSerCys 

AATGACGAGGGCCTGGAGTGTGTGCCCACT 
AenAspGluGlyLeuABpCysValProThr 

CGGATCAAACCTCACCAAGGCCAGCACATA 
ArgHeLysProHifiGlnGlyGlnHielle 

TGTGAATGCAGACCAAAGAAAGATAGAGCA 
CysGluCysArgProLysLysAfipArgAla 

GAGCGGAGAAAGCATTTGTTTGTAC AAGAT 
GluArgArgLysHisLeuPheValGlnAsp 



CACGAAGTGGTGAAGTTCATGGATGTCTAT 
Hi sGluValValLysPheMetAspValTyr 

CTGGTGGACATCTTCCAGGAGTACCCTGAT 
LeuValAspIlePheGlnGluTyrProAsp 

GTGCCCCT.GATGCGATGCGGGGGCTGCTGC 
ValProLeuMatArgCysGluGluCyflCys 

GAGGAGTCCAACATCACCATGCAGATTATG 
GluGluSerAsnIleThrMetGlnlleMet 

GGAGAGATGAGCTTCCTACAGCACAACAAA 
GlyGluMetSerPheLeuGlnHisAsnLys 

AGACAAGAAAATCCCTGTGGGCCTTGCTCA 
ArgGlriGluAsnProCysGlyProCysSer 

CCGCAGACGTGTAAATGTTCCTGCAAAAAC 
ProGlnThrCysLysCysSerCysLysAen 



ACAGACTCGCGTTGCAAGGCGAGGCAGCTT GAGTT AAACGAACGTACTTGCAGATGTGAC 
ThrAspSerArgCysLysAlaArgGInLeu GluL^uABnGluArgThrCysArgCysAsp 



gga t CcGCCCTGGCCCAGAAGCGCG AC AAC GTGCTGTTCCAGGCAGCTACCGACGAGCAG 
GlySerAlaLeuAlaGlnLysArgAspAsn ValLeuPheGlnAlaAlaThrAspGluGln 

CCGGCCGTGATCAAGACGCTGGAGAAGCTG GTCAACATCGAGACCGGCACCGGTGACGCC 
ProAlaVallleLysThrLeuGluLysLeu ValAsnlleGluThrGlyThrGlyAspAla 

GAGGGCATCGCCGCTGCGGGCAACTTCCTC GAGGCCGAGCTCAAGAACCTCGGCTTCACG 
GluGlylleAlaAlaAlaGlyAsnPheLeu GluAlaGluLeuLysAsnLeuGlyPheThr 

GTCACGCGAAGCAAGTCGGCCGGCCTGGTG GTGGGCGACAACATCGTGGGCAAGATCAAG 
ValThrArgSerLysSerAlaGlyLeuVal ValGlyAspAsnlleValGlyLysIleLys 

GGCCGCGGCGGCAAGAACCTGCTGCTGATG TCGCACATGGACACCGTCTACCTCAAGGGC 
GlyArgGlyGlyLysAsnLeuLeuLeuMet SerHisMetAspThrValTyrLeuLysGly 

ATTCTCGCGAAGGCCCCGTTCCGCGTCGAA GGCGACAAGGCCTACGGCCCGGGCATCGCC 
IleLeuAlaLysAlaProPheArgValGlu GlyAspLysAlaTyrGlyProGlylleAla 
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GACGACAAGGGCGGCAACGCGGTCATCCTG CACACGCTCAAGCTGCTGAAG^TACGGC 
AspAspLysGlyGlyAsnAlaVallleLeu HisThrLeuLysLeuLeuLysGluiyrGly 

GTGCGCGACTACGGCACCATCACCGTGCTG TTCAATACCGACGAGGAAAAGGGTTCCTTC 
ValArgAspTyrGlyThrlleThrValLeu PheAsnThrAspGluGluLysGlySerPhe 

GGCTCGCGCGACCTGATCCAGGAAGAAGCC AAGCTGGCCGACTACGTGCTCTCCTTCGAG 
GlySerArgAspLeuIleGlnGluGluAla LysLeuAlaAspTyrValLeuSerPheGlu 

CCCACCAGCGCAGGCGACGAAAAACTCTCG CTGGGCACCTCGGGCATCGCCTACGTGCAG 
ProThrSerAlaGlyAspGluLysLeuSer LeuGlyThrSerGlylleAlaTyrValGln 

GTCCAAATCACCGGCAAGGCCTCGCATGCC GGCGCCGCGCCCGAGCTGGGCGTGAACGCG 
ValGlnlleThrGlyLysAlaSerHisAla GlyAlaAlaProGluLeuGlyValAsnAla 

CTGGTCGAGGCTTCCGACCTCGTGCTGCGC ACGATGAACATCGACGACAAGGCGAAGAAC 
LeuVa 1 G 1 uAl a S e r AspLeuVal LeuAr g ThrMe t Asnl 1 eAspAspLy s Al aLy s Asn 

CTGCGCTTCCAATGGACCATCGCCAAGGCC GG CCAAGTCTCGAACATCATCCCCGCCAGC 
LeuArgPheGlnTrpThrlleAlaLysAla GlyGlnValSerAsnllelleProAlaSer 

GCCACGCTGAACGCCGACGTGCGCTACGCG CGCAACGAGGACTTCGACGCCGCCATGAAG 
AlaThrLeuAsnAlaAspValArgTyrAla ArgAsnGluAspPheAspAlaAlaWetLys 

ACGCTGGAAGAGCGCGCGCAGCAGAAGAAG CTGCCCGAGGCCGACGTGAAGGTGATCGTC 
ThrLeuGluGluArgAlaGlnGlnLysLys LeuProGluAlaAspValLysVallleVal 

ACGCGCGGCCGCCCGGCCTTCAATGCCGGC GAAGGCGGCAAGAAGCTGGTCGACAAGGCG 
ThrArgGlyAxgProAlaPheAsnAlaGly G 1 uG 1 yG 1 y Ly s Ly s LeuVa 1 As pLy s Al a 

GTGGCCTACTACAAGGAAGCCGGCGGCACG CTGGGCGTGGAAGAGCGCACCGGCGGCGGC 
ValAlaTyrTyrLysGluAlaGlyGlyThr LeuGlyValGluGluArgThrGlyGlyGly 

ACCGACGCGGCCTACGCCGCGCTCTCAGGC AAGCCAGTGATCGAGAGCCTGGGCCTGCCG 
ThrAspAlaAlaTyrAlaAlaLeuSerGly LysFroVallleGluSerLeuGlyLeuPro 

GGCTTCGGCTACCACAGCGACAAGGCCGAG TACGTGGACATCAGCGCGATTCCGCGCCGC 
GlyPheGlyTyrHisSerAspLysAlaGlu TyrValAspIleSerAlalleProArgArg 

Eco RI 

CTGTACATGGCTGCGCGCCTGATCATGGAT CTGGGCGCCGGCAAGgaa 1 1 c CATCATCA C 
LeuTyrMetAlaAlaArgLeuIleMetAsp LeuGlyAl aGlyLysGluPhe Jfisflisifis 

415 

Xbal 

CACCATCACGCTTCCTAGtctaga 3 
HisHlsHisAlaSer • -C 



WO 97/26918 



- 74 - 



PCT/GB97/00221 



SEQUENCE QFV iM^CPHg 

SEQ ID NO. 9 : DNA SEQUENCE 

SBQ ID NO. 10 : PROTEIN SEQUENCE Nco 1 

5 ' CCATG 
N- Met 

-26 

GACTTTCTGCTGTCTTGGGTGCATTGGAGC CTTGCCTTGCTGCTCTACCTCCACCATGCC 
AspFheLeuLeuSerTrpValHIaTrt>Ser Le uAlaLeuLeuLeuTyrLenH isHisAlg 

AAGTGGTCCCAGGCTGCACCTATGGCAGAA GGAGGAGGGCAGAATCATCACGAAGTGGTG 

^YsTFPSgrGlpMft AlaProMetAlaAsp GlyGlyGlyGlnAsnHi sHi sGluValVal 

'1 1 

AAGTTC ATGGATGTCTATCAGCGCAGCTAC TGCCAT CCAATCGAGACCCTGGTGGACATC 

LysPheMetAspValTyrGxnArgSerTyr CysHieProIleGluThrLeuValAspIle 

TTCCAGGAGTACCCTGATGAGATCGAGTAC ATCTTCAAGCCATCCTGTGTGCCCCTGATG 
PheGlnGluTyrProAspGluIleGluTyr UePheLysProSerCysValProLeuMet 

CGATGCGGGGGCTGCTGCAATGACGAGGGC CTGGAGTGTGTGCCCACTGAGGAGTCCAAC 
ArgCysGluGluCysCyfiAsnAspGluGly LauAspCysValProThrGluGluSerAsn 

ATCACCATGCAGATTATGCGGATCAAACCT CACCAAGGCCAGCACATAGGAGAGATGAGC 
IleThrMetGlnlloMetArglloLyBPro Hi sGlnGlyGlnHisIleGlyGluMet Ser 

TTCCTACAGCACAACAAATGTGAATGCAGA CCAAAGAAAGATAGAGCAAGACAAGAAAAT 
Ph«LeuGlnHifiAsnLyBCy 6 GluCysArg ProLyeLyaAspArgAlaAr gGlnGluAsn 

Bam HI 11 ^ 

ggatccGCCCTGGCCCAGAAGCGCGACAAC GTGCTGTTCCAGGCAGCTACCGACGAGCAG 
GlySerAlaLeuAlaGlnLysArgAspAsn ValLeuPheGlnAlaAlaThrAspGluGln 

CCGGCCGTGATCAAGACGCTGGAGAAGCTG GTCAAC ATCGAG ACCGGC ACCGG TG ACGCC 
ProAlaVallleLysThrLeuGluLysLeu ValAsnlleGluThrGlyThrGlyAspAla 

GAGGGCATCGCCGCTGCGGGCAACTTCCTC GAGGCCGAGCTCAAGAACCTCGGCTTCACG 
GluGlylleAlaAlaAlaGlyAsnPheLeu GluAlaGluLeuLysAsnLeuGlyPheThr 

GTCACGCGAAGCAAGTCGGCCGGCCTGGTG GTGGGCGACAACATCGTGGGCAAGATCAAG 
ValThrArgSerLysSerAlaGlyLeuVal ValGlyAspAsnlleValGlyLysIleLys 

GGCCGCGGCGGCAAGAACCTGCTGCTGATG TCGCACATGGACACCGTCTACCTCAAGGGC 
GlyArgGlyGlyLysAsnLeuLeuLeuMet SerHisMetAspThrValTyrLeuLysGly 

ATTCTCGCGAAGGCCCCGTTCCGCGTCGAA GGCGACAAGGCCTACGGCCCGGGCATCGCC 
IleLeuAlaLysAlaProPheArgValGlu GlyAspLysAlaTyrGlyProGlyl 1 eAla 

GACGACAAGGGCGGCAACGCGGTCATCCTG CACACGCTCAAGCTGCTGAAGGAATACGGC 
AspAspLysGlyGlyAsnAlaVallleLeu HisThrLeuLysLeuLeuLysGluTyrGly 
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GTGCGCGACTACGGCACCATCACCGTGCTG TTCAATACCGACGAGGAAAAGTCTTCCTTC 
valArgAspTyrGlyThrlleThrValLeu PheAsnThrAspGluGluLysGlySerPhe 

GGCTCGCGCGACCTGATCCAGGAAGAAGCC AAGCTGGCCGACTACGTGCTCTCCTTCGAG 
GlySerArgAspLeuIleGlnGluGluAla LysLeuAlaAspTyrValLeuSexPheGlu 

CCCACCAGCGCAGGCGACGAAAAACTCTCG CTGGGCACCTCGGGCATCGCCTACGTGCAG 
ProThrSerAlaGlyAspGluLysLeuSer LeuGlyThrSerGlylleAlaTyrValGln 

GTCCAAATCACCGGCAAGGCCTCGCATGCC GGCGCCGCGCCCGAGCTGGGCGTGAACGCG 
valGlnlleThrGlyLysAlaSerHisAla GlyAlaAlaProGluLeuGlyValAsnAla 

CTGGTCGAGGCTTCCGACCTCGTGCTGCGC ACGATGAACATCGACGACAAGGCGAAGAAC 
LeuValGluAlaSerAspLeuValLeuArg ThrMetAsiilleAspAspLysAlaLysAsn 

CTGCGCTTCCAATGGACCATCGCCAAGGCC GGCCAAGTCTCGAACATCATCCCCGCCAGC 
LeuArgPheGlr.TrpThrlleAlaLysAla GlyGlnValSerAsnllelleProAlaSer 

GCCACGCTGAACGCCGACGTGCGCTACGCG CGCAACGAGGACTTCGACGCCGCCATGAAG 
AlaThrLeuAsnAlaAspValArgTyrAla ArgAsnGluAspPheAspAlaAlaMetLys 

ACGCTGGAAGAGCGCGCGCAGCAGAAGAAG CTGCCCGAGGCCGACGTGAAGGTGATCGTC 
ThrLeuGluGluArgAlaGlnGlnLysLys LeuProGluAlaAspValLysVallleVal 

ACGCGCGGCCGCCCGGCCTTCAATGCCGGC GAAGGCGGCAAGAAGCTGGTCGACAAGGCG 
ThrArgGlyArgProAlaPheAsnAlaGly GluGlyGlyLysLysLeuValAspLysAla 

GTGGCCTACTACAAGGAAGCCGGCGGCACG CTGGGCGTGGAAGAGCGCACCGGCGGCGGC 
ValAlaTyrTyrLysGluAlaGlyGlyThr LeuGlyValGluGluArgThrGlyGlyGly 

ACCGACGCGGCCTACGCCGCGCTCTCAGGC AAGCCAGTGATCGAGAGCCTGGGCCTGCCG 
ThrAspAlaAlaTyxAlaAlaLeuSerGly LysProVallleGluSerLeuGlyLeuPro 

GGCTTCGGCTACCACAGCGACAAGGCCGAG TACGTGGACATCAGCGCGATTCCGCGCCGC 
GlyPheGlyTyrHisSerAspLysAlaGlu TyrValAspIleSerAlalleProArgArg 

Eco RI 

CTGTACATGGCTGCGCGCCTGATCATGGAT CTGGGCGCCGGCAAGgaattcCATCVlTCAC 
LeuTyrMetAlaAlaArgLeuIl«MetAsp LeuGlyAlaGlyLysGluPheffisffis/fis 

Xbal 

CACCATCACGCTTCCTAGtcteiga 3 ' 
HisHisHisAl&Ser • -C 
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Nco 1 

5 ' CCATGGAGCTGGCGGCCTTGTGC 

1 

CGCTGGGGGCTCCTCCTCGCCCTCTTGCCC CCCGGAGCCGCGAGCACCCAAGTGTGCACC 
ArqTrpGlvLeuLeuLeuAlaLeuLeuPro ProGlvAlaAlaSerThrGlnValCvsThr 

27 

Barn HI 

gaatccGCCCTGGCCCAGAAGCGCGACAAC GTGCTGTTCCAGGCAGCTACCGACGAGCAG 
GlySerAlaLeuAlaGlnLysArgAspAsn ValLeuPheGlnAlaAlaThrAspGluGln 

23 

CCGGCCGTGATCAAGACGCTGGAGAAGCTG GTCAACATCGAGACCGGCACCGGTGACGCC 
Pr oAlaVal II eLysThrLeuGluL ysLeu ValAsnl 1 eGluThrGlyThrGlyAspAla 

GAGGGCATCGCCGCTGCGGGCAACTTCCTC GAGGCCGAGCTCAAGAACCTCGGCTTCACG 
GluGlylleAlaAlaAlaGlyAsnPheLeu GluAlaGluLeuLysAsnLeuGlyPheThr 

GTCACGCGAAGCAAGTCGGCCGGCCTGGTG GTGGGCGACAACATCGTGGGCAAGATCAAG 
ValThrArgSerLysSerAlaGlyLeuVal ValGlyAspAsnlleValGlyLysIleLys 

GGCCGCGGCGGCAAGAACCTGCTGCTGATG TCGCACATGGACACCGTCTACCTCAAGGGC 
GlyAxgGlyGlyLysAsnLeuLeuLeuMet SerHisMetAspThrValTyrLeuLysGly 

ATTCTCGCGAAGGCCCCGTTCCGCGTCGAA GGCGACAAGGCCTACGGCCCGGGC ATCGC C 
II eLeuAlaLysAlaProPheArgValGlu GlyAspLysAlaTyrGlyProGlylleAla 

GACGACAAGGGCGGCAACGCGGTCATCCTG CACACGCTCAAGCTGCTGAAGGAATACGGC 
AspAspLysGlyGlyAsriAlaVallleLeu HisThrLeuLysLeuLeuLysGluTyrGly 

GTGCGCGACTACGGCACCATCACCGTGCTG TTCAATACCGACGAGGAAAAGGGTTCCTTC 
ValArgAspTyrGlyThrlleThrValLeu PheAsnThrAspGluGluLysGlySerPhe 

GGCTCGCGCGACCTGATCCAGGAAGAAGCC AAGCTGGCCGACTACGTGCTCTCCTTCGAG 
GlySerArgAspLeuIleGlnGluGluAla LysLeuAlaAspTyrValLeuSerPheGlu 

CCCACCAGCGCAGGCGACGAAAAACTCTCG CTGGGCACCTCGGGCATCGCCTACGTGCAG 
ProThrSexAlaGlyAspGluLysLeuSer LeuGlyThrSerGlylleAlaTyrValGln 

GTCCAAATCACCGGCAAGGCCTCGCATGCC GGCGCCGCGCCCGAGCTGGGCGTGAACGCG 
ValGlnlleThrGlyLysAlaSerHisAla GlyAlaAlaProGluLeuGlyValAsnAla 

CTGGTCGAGGCTTCCGACCTCGTGCTGCGC ACGATGAACATCGACGACAAGGCGAAGAAC 
LeuValGluAlaSerAspLeuValLeuArg ThrMetAsnlleAspAspLysAlaLysAsn 

CTGCGCTTCCAATGGACCATCGCCAAGGCC GGCCAAGTCTCGAACATCATCCCCGCCAGC 
LeuArgPheGlnTrpThrlleAlaLysAla GlyGlnValSerAsnllelleProAlaSer 

GCCACGCTGAACGCCGACGTGCGCTACGCG CGCAACGAGGACTTCGACGCCGCCATGAAG 
AlaThrLeuAsnAlaAspValArgTyrAla ArgAsnGluAspPheAspAlaAlaMetLys 
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ThrLeuGluGluArgAlaGlnGlnLysLys LeuProGluAlaAspValLysVallleVal 

ACGCGCGGCCGCCCGGCCTTCAATGCCGGC G AAGG CGG C AAG AAGCTGGTCG AC AAG GC G 
ThrArgGlyArgProAlaPheAsnAlaGly GluGlyG lyLy s LysLeuVal AspLy s Al a 

GTGGCCTACTACAAGGAAGCCGGCGGCACG CTGGGCGTGGAAGAGCGCACCGGCGGCGGC 
ValAlaTyrTyrLysGluAlaGlyGlyThr LeuGlyValGluGluArgThrGlyGlyGly 

ACCGACGCGGCCTACGCCGCGCTCTCAGGC AAGCCAGTGATCGAGAGCCTGGGCCTGCCG 
ThrAspAlaAlaiyrAlaAlaLeuSerGly LysProVallleGluSerLeuGlyLeuPro 

GGCTTCGGCTACCACAGCGACAAGGCCGAG TACGTGGACATCAGCGCGATTCCGCGCCGC 
GlyPheGlyTyrHisSerAspLysAlaGlu ^YrValAspI leSerAlalleProArgArg 



CTGTACATGGCTGCGCGCCTGATCATCGAT CTGGGCGCCGGC AAGgaa t tcGGAGGTGGC 
LeuiyrMetAlaAlaArgLeuIleMetAsp LeuGlyAlaGlyLysGluPhe G 1 vGl vGl v 



YQl VThr AlaProMetAlaABpGlyGly GlyGlnAsiiHisHisGluValValLysPhe 




ATGGATGTCTATCAGCGCAGCTACTGCCAT 
MetAspValTyrGlnArgSerTyrCysHis 



CCAATCGAGACCCTGGTGGACATCTTCCAG 
ProIleGluThrLeuValAspIlePheGln 



GAGTACCCTGATGAGATCGAGTACATCTTC 
GluTyrProAspGluIleGluTyrllePhe 



AAGCCATCCTGTGTGCCCCTGATGCGATGC 
LyaProSerCysValProLeuMetArgCys 



GGGGGCTGCTGCAATGACGAGGGCCTGGAG 
GluGluCysCysAanAspGluGlyUuAsp 



TGTGTGCCCACTGAGGAGTCCAACATCACC 
CysValProThrGluGluSerAsnlleThr 



ATGCAGATTATGCGGATCAAACCTCACCAA 
MetGlnlXeMetArglleLysProHiBGln 



GGCCAGCACATAGGAGAGATGAGCTTCCTA 
GlyGlnHisIleGlyGluMetSerPheLeu 



CAGCACAACAAATGTGAATGCAGACCAAAG 
GlnHisAfiiiLysCyBGluCysArgProLys 



AAAGATAGAGCAAGACAAGAAAATCCCTGT 
LysAspArgAlaArgGlnGluAsnProCyE 



GGGCCTTGCTCAGAGCGGAGAAAGCATTTG 
GlyProCysSdrGluArgArgLyfiHisLeu 



TTTGTACAAGATCCGCAGACGTGTAAATGT 
PheValGlnAapProGlnThrCysLysCys 



TCCTGCAAAAACACAGACTCGCGTTGCAAG 
SerCysLyBAsnThrAspSorArgCysLys 



GCGAGGCAGCTTGAGTTAAACGAACGTACT 
AlaArgGlnLeuGluLouAsnGluArgThr 



TGCAGATGTGACAAGCCGAGGCGGTGA 3 ' 
CysArgCysAspLysProArgArg • -c 



165 
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Sequences of CPV 109 

SEQ ID NO. 13: DNA Sequence 
SEQ ID NO. 14: Protein Sequence 

Nco 1 

5' CCATGGAGCTGGCGGCCTTGTGC 
N- ft«>t-c;luI,euAlaAlaLeuCvs 
1 

CGCTGGGGGCTCCTCCTCGCCCTCTTGCCC CCCGGAGCCGCGAGCACCCAAGTGTGCACC 
trnrrpSlvLauLeuLBuM «T.Biit.iiuPro f T nr,l vAl »Al«S«rThrfi1 nValCvsThr 



ogatCcGCCCTGGCCCAGAAGCGCGACAAC GTGCTGTTCCAGGCAGCTACCGACGAGCAG 
GlySerAlaLeuAlaGlnLysArgAspAsri ValLeuPheGlnAlaAlaThrAspGluGln 
23 

CCGGCCGTGATCAAGACGCTGGAGAAGCTG GTCAACATCGAGACCGGCACCGGTGACGCC 
PrbAlaVallleLysThrLeuGluLysLeu ValAsnlleGluThrGlyThrGlyAspAla 

GAGGGCATCGCCGCTGCGGGCAACTTCCTC GAGGCCGAGCTCAAGAACCTCGGCTTCACG 
GluGlyll eAl aAl aAlaG lyAsnPheLeu GluAlaGluLeuLys AsnLeuGly PheThr 

GTC^CGCGAAGCAAGTCGGCCGGCCTGGTG GTGGGCGACAACATCGTGGGCAAGATCAAG 
ValThrArgSerLysSerAlaGlyLeuVal valGlyAspAsnlleValGlyLysIleLys 

GGCCGCGGCGGCAAGAACCTGCTGCTGATG TCGCACATGGACACCGTCTACCTCAAGGGC 
GlyArgGlyGlyLysAsnLeuLeuLeuMet SerHisMetAspThrValTyrLeuLysGly 

ATTCTCGCGAAGGCCCCGTTCCGCGTCGAA GGCGACAAGGCCTACGGCCCGGGCATCGCC 
IleLeuAlaLysAlaProPheArgValGlu GlyAspLysAlaTyrGlyProGlylleAla 

GACGACAAGGGCGGCAACGCGGTCATCCTG CACACGCTCAAGCTGCTGAAGGAATACGGC 
AspAspLysGlyGlyAsnAlaVallleLeu HisThrLeuLysLeuLeuLysGluiyrGly 

GTGCGCGACTACGGCACCATCACCGTGCTG TTCAATACCGACGAGGAAAAGGGTTCCTTC 
ValArgAspTyrGlyThrlleThrValLeu pheAsnThrAspGluGlxoLysGlySerPhe 

GGCTCGCGCGACCTGATCCAGGAAGAAGCC AAGCTGGCCGACTACGTGCTCTCCTTCGAG 
GlySerArgAspLeuIleGlnGluGluAla LysLeuAlaAspTyrValLeuSerPheGlu 

CCCACCAGCGCAGGCGACGAAAAACTCTCG CTGGGCACCTCGGGCATCGCCTACGTGCAG 
ProThrSerAlaGlyAspGluLysLeuSer LeuGlyThrSerGlylleAlaTyrValGln 

GTCCAAATCACCGGCAAGGCCTCGCATGCC GGCGCCGCGCCCGAGCTGGGCGTGAACGCG 
ValGlnlleThrGlyLysAlaSerHisAla GlyAlaAlaProGluLeuGlyValAsnAla 

CTGGTCGAGGCTTCCGACCTCGTGCTGCGC ACGATGAACATCGACGACAAGGCGAAGAAC 
LeuValGluAlaSerAspLeuValLeuArg ThrMetAsnlleAspAspLysAlaLysAsn 

CTGCGCTTCCAATGGACCATCGCCAAGGCC GGCCAAGTCTCGAACATCATCCCCGCCAGC 
LeuArgPheGlnTrpThrlleAlaLysAla GlyGlnValSerAsnllelleProAlaSer 

GCCACGCTGAACGCCGACGTGCGCTACGCG CGCAACGAGGACTTCGACGCCGCCATGAAG 
AlaThrLeuAsnAlaAspValArgTyrAla ArgAsnGluAspPheAspAlaAlaKetLys 
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ACGCTGGAAGAGCGCGCGCAGCAGAAGAAG CTGCCCGAGGCCGACGTGAAGGTGATCGTC 
ThrLeuGluGluArgAlaGlnGlnLysLys LeuProGluAlaAspValLysVal IleVal 

ACGCGCGGCCGCCCGGCCTTCAATGCCGGC GAAGGCGGCAAGAAGCTGGTCGACAAGGCG 
ThrArgGlyArgProAlaPheAsnAlaGly GluGlyGlyLysLysLeuValAspLysAla 

GTGGCCTACTACAAGGAAGCCGGCGGCACG CTGGGCGTGGAAGAGCGCACCGGCGGCGGC 
ValAlaTyrTyrLysGluAlaGlyGlyThr LeuGlyValGluGluArgThrGlyGlyGly 

ACCGACGCGGCCTACGCCGCGCTCTCAGGC AAGCCAGTGATCGAGAGCCTGGGCCTGCCG 
ThrAspAlaAlaTyrAlaAlaLeuSerGly LysProVallleGluSerLeuGlyLeuPro 

GGCTTCGGCTACCACAGCGACAAGGCCGAG TACGTGGACATCAGCGCGATTCCGCGCCGC 
GlyPheGlyTyrHisSerAspLysAlaGlu TyrV&iAspIleSerAlalleProArgArg 



CTGTACATGGCTGCGCGCCTGATCATGGAT 
LeuTyrMetAlaAlaArgLeuIleMetAsp 

Kpn 1 

GGAggtacc gcacccatggcagaaggagga 

Gl VGI yThr- AlaProMatAlaAwpflTym y 

I 

ATGGATGTCTATCAGCGCAGCTACTGCCAT 
Met AspValTyrGlnArgSerTyrCysHis 

GAGTACCCTGATGAGATCGAGTACATCTTC 
GluTyrProAspGluIleGluTyrllePhe 

GGGGGCTGCTGCAATGACGAGGOCCTGGAG 
GluGluCycCysAsnAspGluGlyLeuAsp 

ATGCAGATTATGCGGATCAAACCTCACCAA 
MetGlnlleMetArglleLyflProHiflGln 



Eco RI 

CTGGGCGCCGGCAAGgaattcGGAGGTGGC 
LeuGlyAlaGlyLysGluPhe Gi vGl vGl v 
415 

GGGCAGAATCATCACGAAGTGGTGAAGTTC 
GlyGlnAsnHisHisGluValValLysPhe 



CCAATCGAGACCCTGGTGGAC ATCTTCCAG 
ProIleGluThrLeuValAspIlePheGln 

AAGCCATCCTGTGTGCCCCTGATGCGATGC 
LysProSerCysValProLeuMetArgCys 

TGTGTGCCCACTGAGGAGTCCAACATCACC 
CysValProThrGluGluSerAsnlleThr 

GGCCAGCACATAGGAGAGATGAGCTTCCTA 
GlyGlnHisIleGlyGluMe tSerPheLeu 



★ 

CAGCACAACAAATGTGAATGCAGACCAAAG AAAGATTGAGCAAGAC AAGAAAATCCCTGT 
GlnHisAsnLyaCysGluCysArgProLye LysAsp * -C 

109 



GGGCCTTGCTCAGAGCGGAGAAAGCATTTG TTTGTACAAGATCCGCAGAC GTGTAAATGT 



TCCTGCAAAAACACAGACTCGCGTTGCAAG GCGAGGCAGCTTGAGTTAAACGAACGTACT 



TGCAGATGTGACAAGCCGAGGCGGTGA 3 * 
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SEQUENCE ID NOS. 15 and 16 

Nco 1 

5 f CCATGGAGCTGGCGGCCTTGTGC 
N~ MPt-MnT^nAlaAlaT.*,,^ 
1 

CGCTGGGGGCTCCrCCTCGCCCTCTTGCCC CCCGGAGCCGCGAGCACCCAAGTGTGCACC 
ArqTf Pg^VTiftHLeuLffl l fll flLeuLevPTP ProGl vAI a A laSerThrftl nVal OvsTh r 

27 

Ban HI 

ggatccGCCCTGGCCCAGAAGCGCGACAAC GTGCTGTTCCAGGCAGCTACCGACGAGCAG 
GlySerAlaLeuAlaGlnLysArgAspAsn ValLeuPheGlnAlaAlaThrAspGluGln 

C CGGCCCTG ATC AAGACGCTGGAGAAGCTG GTCAACATCGAGACCGGCACCGGTGACGCC 
ProAlaValileLysThrLeuGluLysLeu ValAsnlleGluThrGlyThrGlyAspA) a 

GAGGGCATCGCCGCTGCGGGCAACTTCCTC GAGGCCGAGCTCAAGAACCTCGGCTTCACG 
GluGlylleAlaAlaAlaGlyAsnPhcLeu GluAlaGluLeuLysAsnLeuGlyPheThr 

GTCACGCGAAGCAAGTCGGCCGGCCTGGTG GTGGGCG ACAACATCG TGGGCAAGATC AAG 
ValThrArgSerLysSerAlaG] yI,euVal ValGlyAspAsnlleValGlyLysIleLys 

GGCCGCGGCGGCAAGAACCTGCTGCTGATG TCGCACATGGACACCX3TCTACCTCAAGGGC 
GlyArgGlyGlyLysAsnLeuLeuLeuMet. SerHisMetAspThrValTyrLeuLysGly 

ATTCTCGCGAAGGCCCCGTTCCGCGTCGAA GGCGACAAGGCCTACGGCCCGGGCATCGCC 
IleLeuAla^ysAlaProPheArgValGlu GlyAspLysAlaTyrGlyProGlylleAl a 

GACGACAAGGGCGGCAACGCGGTCATCCTG CACACGCTCAAGCTGCTGAAGGAATACGGC 
AspAspLysGlyGlyAsnAlaVallleLeu HisThrLeuLysLeuLeuLysGluTyrGly 

GTGCGCGACTACGGCACCATCACCGTGCTG TTCAATACCGACGAGGAAAAGGGTTCCTTC 
ValArgAspTyrGlyThrllcThrValLeu PheAsnThrAspGluGluLysGlySerPhe 

GGCTCGCGCGACCTGATCCAGGAAGAAGCC AAGCTGGCCGACTACGTGCTCTCCTTCGAG 

GlyserArgAspLeuIleGlnGluGluAla LysLeuAlaAspTyrValLeuSerPheGlu 

CCCACCAGCGCAGGCGACGAAAAACTCTCG CTGGGCACCTCGGGCATCGCCTACGTGCAG 
ProThrSerAlaGlyAspGluLysLeuSer LeuGlyThrSerGlylleAlaTyrValGln 

GTCCAAATCACCGGCAAGGCCTCGCATGCC GGCGCCGCGCCCGAGCTGGGCGTGAACGCG 
ValGlnlleThrGlyLysAlaSerHisAla GlyAlaAlaProGluLeuGlyValAsnAla 

CTGGTCGAGGCTTCCGACCTCGTGCTGCGC ACGATGAACATCGACGACAAGGCGAAGAAC 
LeuValGluAlaSerAspLeuValLeuArg ThrMetAsnlleAspAspLysAlaLysAsn 

CTGCGCTTCCAATOGACCATCGCCAAGGCC GGCCAAGTCTCGAACATCATCCCCGCCAGC 
LeuArgPheGlnTrpThrlleAlaLysAla GlyGlnValSerAsnllelleProAlaSer 

GCCACGCTGAACGCCGACGTGCGCTACGCG CGCAACGAGGACTTCGACGCCGCCATGAAG 
AlaThrLeuAsnAlaAspValArgTyrAla ArgAsnGluAspPheAspAlaAlaMetLys 
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ACGCTGGAAGAGCGCGCGCAGCAGAAGAAG CTGCCCGAGGCCGACGTGAAGGTGATCGTC 
ThrLeuGluGluArgAlaGlnGlnLysLys LeuProGluAlaAspValLysVallleVal 

ACGCGCGGCCGCCCGGCCTTCAATGCCGGC GAAGGCGGCAAGAAGCTGGTCGACAAGGCG 
ThrArgGlyArgProAlaPheAsnAlaGly GluGlyGlyLysLysLeuValAspLysAla 

GTGGCCTACTACAAGGAAGCCGGCGGCACG CTGGGCGTGGAAGAGCGCACCGGCGGCGGC 
ValAlaTyrTyrLysGluAlaGlyGlyThr LeuGlyValGluGluArgThrGlyGlyGly 

ACCGACGCGGCCTACGCCGCGCTCTCAGGC AAGCCAGTGATCGAGAGCCTGGGCCTGCCG 
ThrAspAlaAlaTyrAlaAlaLeuSerGly LysProVallleGluSerLeuGlyLeuPro 

GGCTTCGGCTACCACAGCGACAAGGCCGAG TACGTGGACATCAGCGCGATTCCGCGCCGC 
GlyPheGlyTyrHisSerAspLysAlaGlu TyrValAspIleSerAlalleProArgArg 

Eco RI 

CTGTACATGGCTGCGCGCCTGATCATGGAT CTGGGCGCCGGCAAGgaa 1 1 cGGAGGTGGC 
LeuTyrMetAlaAlaArgLeuIleMetAsp LeuGlyAlaGlyLysGluPhe GJ vGl vGl v 

415 

Kpn 1 

GGAggtacCGCACCCATGGCAGAAGGAGGA GGGCAGAATCATCACGAAGTGGTGAAGTTC 
Ql VClyTh r Al aP roMfl t Al a As pG ly G 1 y GlyGlnAsnHisHisGluValValLysPhe 

ATGGATGTCTATC AGCGCAGCTACTGCC AT CCAATCGAGACCCTGGTGGACATCTTCCAG 
MetAfipValTyrGlnArgSerTyrCysHis ProIleGluThrLeuVaUspIlcPheGln 

GAGTACCCTGATGAGATCGAGTACATCTTC AAGCCATCCTGTGTGCCCCTGATGCGATGC 
GluTyrProAspGluIleGluTyrllePhe Ly sProSe rCy sVal ProLeuMe t ArgCys 

GGGGGCTGCTGCAATGACGAGGGCCTGGAG TGTGTGCCCACTGAGGAGTCCAACATCACC 
GluGluCyeCysABnAspGluGlyLeuAsp CysValProThrGluGluSerAsnlleThr 

ATGCAGATTATGCGGATCAAACCTCACCAA GGCCAGCACATAGGAGAGATGAGCTTCCTA 
MetGlnlleMetArglleLysProHisGln GlyGlaHi e I leGlyGluMet SerPheLeu 

CAGCACAACAAATGTGAATGCAGACCAAAG AAAG ATAGAGC AAGACAAGAAAATCCCTGT 
GlnHisAsnLysCysGluCyBArgProLyE LysAspArgAlaArgGlnGluAsnProCys 

GGGCCTTGCTCAGAGCGGAGAAAGCATTTG TTTGTACAAGATCCGCAGACGTGTAAATGT 
GlyProCysSerGluArgArgLysHiBLeu PheValGlnAcpProGlnThrCysLysCyB 

TCCTGCAAAAACACAGACTCGCGTTGCAAG GCGAGGCAGCTTGAGTTAAACGAACGTACT 
SerCysLyflAsnThrAepSerArgCysLys AlaArgGlnLeuGluLeuAsnGluArgThr 

Bam HI Eco RI xba T 

TGCAGATGTGAcggattcgaattcCATCAT CACCACCATCACGCTTCCTAGtctaga 3* 
cysArocysAspGlySerGluPhe//i sHis HisHisHi sHi sAlaSer • -C 
161 
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SEQUENCE ID NOS . 17 and 18 

Nco 1 

5 f CCATGGAGCTGGCGGCCTTGTGC 

n- MetGlulguAlaAlaLfiuCys 
l 

CGCTGGGGGCTCCTCCTCXSCXXr^ CCCGGAGCCGCGAGCACCCAAGTGTGCACC 

ArorTrpGl vLeuLeuLenAIaLeuLreuPro ProGlvAlaAlaSerThrGlnValCvsThr 

27 

Bam HI 

gga t ccGCCCTGGCCCAGAAGCGCGACAAC GTGCTGTTCCAGGCAGCTACCGACGAGCAG 
GlySerAlaLeuAlaGlnLysArgAspAsn ValLeuPheGlnAlaAlaThrAspGluGln 
23 

CCGGCCGTGATCAAGACGCTGGAGAAGCTG GTCAACATCGAGACCGGCACCGGTGACGCC 
Pro Al aVa 1 1 1 eLy sThr LeuG 1 ul.y sLeu Val Asnl 1 eGl uThrGl yThrG lyAspAl a 

GAGGGCATCGCCGCTGCGGGCAACTTCCTC GAGGCCGAGCTCAAGAACCTCGGCTTCACG 
GluGlylleAlaAlaAIaGlyAsnPheLreu GluAlaGluLeuLysAsnLeuGlyPheThr 

GTCACGCGAAGCAAGTCGGCCGGCCTGGTG GTGGGCGACAACATCGTGGGCAAGATCAAG 
ValThrArgSerLysSerAlaGlyLeuVal ValGlyAspAsnlleValGlyLysIleLys 

GGCCGCGGCGGCAAGAACCTGCTGCTGATG TCGC ACATGGACACCGTCT ACCTC AAGGG C 
GlyArgGlyGlyLysAsnLeuLeuI/euMet SerKisMetAspThrValTyrLeuLysGly 

ATTCTCGCGAAGGCCCCGTTCCGCGTCGAA GGCGACAAGGCCTACGGCCCGGGCATCGCC 
IleLeuAlaLysAlaProPbeArgValGlu GlyAspLysAlaTyrGlyProGlylleAla 

GACGACAAGGGCGGCAACGCGGTCATCCTG CACACGCTCAAGCTGCTGAAGGAATACGGC 
AspAspLysGlyGlyAsnAlaVallleLeu HisThrLeuLysLeuLeuLysGluTyrGly 

GTGC GCG ACT ACGGC ACC ATCACCGTGCTG TTCAATACCGACGAGGAAAAGGGTTCCTTC 
ValAr g AspTy rGl yThrll eThrVa ] Leu PheAsnThrAspGluGluLysGlySerPbe 

GGCTCGCGCGACCTGATCCAGGAAGAAGCC AAGCTGGCCGACTACGTGCTCTCCTTCGAG 
GlySerArgAspLeuIleGlnGluGluAla LysLeuAlaAspTyrValLeuSerPheGlu 

CCCACCAGCGCAGGCGACGAAAAACTCTCG CTGGGCACCTCGGGCATCGCCTACGTGCAG 
ProThrSerAlaGlyAspGluLysLeuSer LeuGlyThrSerGlylleAlaTyrValGln 

GTCCAAATCACCGGCAAGGCCTCGCATGCC GGCGCCGCGCCCGAGCTGGGCGTGAACGCG 
ValGlnlleThrGlyLysAlaSerHisAla GlyAlaAlaProGluLeuGlyValAsnAla 

CTGGTCGAGGCTTCCGACCTCXnXSCTGCGC ACGATGAACATCGACGACAAGGCGAAGAAC 
LeuValGluAlaSerAspLeuValLeuArg ThrMetAsnll eAspAspLy sAl aLys Asn 

CTGCGCTTCCAATGGACCATCGCCAAGGCC GGCCAAGTCTCGAACATCATCCCCGCCAGC 
LeuArgPheGlnTrpThrlleAlaLysAla GlyGlnValSerAsnllelleProAlaSer 

GCCACGCTGAACGCCGACGTGCGCTACGCG CGCAACGAGGACTTCGACGCCGCCATGAAG 
AlaThrLeuAsnAlaAspValArgTyrAla ArgAsnGluAspPheAspAlaAlaMetLys 
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ThrZ^ri G * GC ^ G ^^ G ™ G ^ Q CTG CCCGAGGCCGACGTGAAGGTGATCGTC 
ThrLeuGluGluArgAlaGlnGlnLysLys LeuProGluAlaAspValLysVallleVal 

ACGCGCGGCCGCCCGGCCTTCAATGCCGGC GAAGGCGGCAAGAAGCTGGTCGACAAGGCG 
ThrArgGlyArgProAlaPheAsnAlaGly GluGlyGlyLysLysLeuValSp^SfS 

GTGG CC T ACTAC AAGG AAGCCGGCGGCACG CTGGGCGTGGAAGAGCGCACCGGCGGCGTr 
ValAlaTyrryrLysGluAlaGlyGlyThr LeuGlyValGluGlSrgThrGlyG^yGly 

ACCGACGCGGCCTACGCCGCGCTCTCAGGC AAGCCAGTGATCGAGAGCCTGGGCCTGCCC 
ThrAspAlaAlaTyrAlaAlaLeuSerGly LysProValIleGlus22SSSSS 

GGCTCGGCTACCACAGCGACAAGGCCGAG TACGTGGACATCAGCGCGATTCCGCGCCGC 
GlyPheGlyryrHisSerAspLysAlaGlu ^VrValAspIleSerAlallePrSSgSg 

J£<ro R T 

^SS^^V^^ 1 ^ CTGGGCGCCGGCAAGgaattcGGAGGTGGC 
LeuTyrMetAlaAlaArgLeuIleMetAsp LeuGlyAlaGlyLysGluPhe GJ vG IvGl v 

Kpn 1 415 

g?tS? 5^ « ? ^ CCATGGCAGAAGGA °GA GGGCAGAATCATCACGAAGTGGTGAAGTTC 
Gl YQ1 YThr ^roHe^XspGlyGly GlyGlnAsnHisHisGluValValLysPhe 

A ItA A p?a" A ^ A !x GCAGCTACTGCCAT ^CAATCGAGACCCTGGTGGACATC TTCCAG 
BetAspValTyrGlnArgSerTyrCysHis ProIleGluThrLcuVaUspUePheGln 

GAGTACCCTGATGAGATCGAGTACATCTTC AAGCCATCCTGTGTGCCCCTGATGCGATGC 
CluTyrProAcpGluIleGluTyrHePhe I.y-ProSerCy.v.^SISnS™ 

G GG r GC J GC r CAATGACGAGGGCCTGGAG TGTGTGCCCACTGAGGAGTCCAAC ATCACC 
GluGluCysCysAanAspGluGlyLeuAsp CysValProThrGluGiuSerA^nJIe^r 

iltG^n^IJ^Araxr^ 0 ^* 0 "* ^"^CCAGCACATAGGAGAGATGAGCTTCCTA 
MetGlnlleMetArglleLysProHisGln GlyGluHisIleGlyGluMetSerPheLeu 

Bam HI Eco RI 

G A nHis A^n ^r^rr^ 0 * 00 ^ 0 ^ G ^ga ttcgaattc CA TCA TCACCA C 
GlnH i8 AsnLysCy B GluCy S Ar g ProLys LysAspG 1 y SerG 1 u PheHi sHi sHi sHi s 



Xbal 

CATCACGCTTCCTAGtctaga 3 
HisHisAlaSer - -c 
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SEQUENCE ID NOS . 19 and 20 
A'CO I 

5' CCATGCAGAGCAAGGTGCTGCTGGCCGTC GCCCTGTGGCTCTGCGTGGAGACCCGGGCC 
N- MetGluSerLvsValLeuLeuAlaVal AlaLeuTrpLeuCVeVAlQltiT hrArgAI a 
-19 

GCCTCTGTGGGTTTGCCTAGTGTTTCTCTT GATCTGCCCAGGCTCAGCATACAAAAAGAC 
AlaSerValGlyLeuProSerValSerLeu AspLeuProArgLeuSerlleGlnLysAsp 



ATACTTACAATTAAGGCTAATACAACTCTT CAAATTAC7TGCAGGGGACAGAGGGACTTG 
IleLeuThrlleLysAlaAsnThrThrLeu GlnlleThrCysArgGlyGlnArgAspLeu 

GACTGGCTTTGGCCCAATAATCAGAGTGGC AGTGAGCAAAGGGTGGAGGTGACTGAGTGC 
AspTrpLeuTrpProAsnAsnGlnSerGly SerGluGlnArgValGluValThrGluCys 

AGCGATGGCCTCTTCTGTAAGACACTCACA ATTCCAAAAGTGATCGGAAATGACACTGGA 
SerAspGlyLeuPhcCysLysThrLeuThr IleProLysVallleGlyAsnAspThrGly 

GCCTACAAGTGCTTCTACCGGGAAACTGAC TTGGCCTCGGTCATTTATGTCTATGTTCAA 
AlaTyrLysCysPheTyrArgGluThrAsp l^uAlaSerVallleTyrTyrTyrValGln 

GATTACAGATCTCCATTTATTGCTTCTGTT AGTGACCAACATGGAGTCGTGTACATTACT 
AspTyrArgSerProPheIl€AlaSerVal SerAspGlnHisGlyValValTyrlleThr 

GAGAACAAAAACAAAACTGTGGTGATTCCA TGTCTCGGGTCCATTTCAAATCTCAACGTG 
GluAsnLysAsnLysThrValValllePro CysLeuGlySerlleSerAsnLeuAsnVal 

TCACTTTGTGCAAGATACCCAGAAAAGAGA TTTGTTCCTGATGGTAACAGAATTTCCTGG 
S e r Leu Cys Al aArgTyr ProGl uLys Arg PheValProAspGlyAsnArg I leSerTrp 

GACAGCAAGAAGGGCTTTACTATTCCCAGC TACATGATCAGCTATGCTGGCATGGTCTTC 
AspSerLysLysGlyPheThrlleProSer TyrMetlleSerTyrAlaGlyMetValPhe 

TGTGAAGCAAAAATTAATGATCAAAGTTAC CAGTCTATTATGTACATAGTTGTCGTTGTA 
CysGluAlaLysIleAsnAspGluSerTyr GlnSerlleMetTyrlleValValValVal 

GGGTATAGGATTTATGATGTGGTTCTGAGT CCGTCTCATGGAATTGAACTATCTGTTGGA 
GlyTyrArgileTyrAspValValLeuSer ProSerHisGlylleGluLeuSerValGly 

GAAAAGCTTGTCTTAAATTGTACAGCAAGA ACTGAACTAAATGTCGGGATTGACTTCAAC 
GluLysT^euVall^uAsnCysThrAlaArg ThrGluLeuAsnValGlylleAspPheAsn 

TGGGAATACCCTTCTTCGAAGCATCAGCAT AAGAAACTTGTAAACCGAGACCTAAAAACC 
TrpGluTyxProSerSerLysHisGlnliis LysLysLeuValAsnArgAspLeuLysThr 

CAGTCTGGGAGTGAGATGAAGAAATTTTTG AGCACCTTAACTATAGATGGTGTAACCCGG 
GlnSerGlySerGluMetLysLysPheLeu SerThrLeuThxIleAspGlyValThrArg 

AGTGACCAAGGATTGTACACCTGTGCAGCA TCCAGTGGGCTGATGACCAAGAAGAACAGC 
SerAspGlnGlyLeuTyrThrCysAlaAla SerSerGlyLeuMetTfarLysLysAsnSer 
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ACATTTGTCAGGGTCCATGAAAAACCTTTT GTOGCTTTTGGAAGTGGCATGGAATCICTG 
ThrrheValArgValHisGluLysProPhe ValAlaPheGlySerGlyMctGluSerLeu 

GTGGAAGCCACGGTGGGGGAGCGTGTCAGA ATCCCTGCGAAGTACCTTGGTTACCCACCC 
ValGluAlaThirvalGlyGluArgValArg IleProAlaLysTyrLeuGJyTyrProPro 

CCAGAAATAAAATGGTATAAAAATGGAATA CCCCTTGAGTCCAATCACACAATTAAAGCG 
ProGluTleLysTrpTyrLysAsnGlylle ProLeuGluSerAsnHisThrlleLysAla 

GGGCATGTACTGACGATTATGGAAGTGAGT GAAAGAGACACAGGAAATTACACTGTCATC 
GlyHisVal LeuThrlleMetGluValSer GluArgAspThrGlyAsnTyrThrVallle 

CTTACCAATCCCATTTCAAAGGAGAAGCAG AGCCATGTGGTCTCTCTGGTTGTGTATGTC 
LeuThrAsnProIleSerLysGluLysGln SerHisValValSerLeuValValTyrVal 

CCACCCCAGATTGGTGAGAAATCTCTAATC TCTC CTGTGGATTCCTACCAG T ACGGCAC C 
ProProGlnlleGlyGluLysSerLeuIle SerProValAspSerTyrGlnTyrGlyThr 

ACTCAAACGCTGACATGTACGGTCTATGCC ATTCCTCCCCCGCATCACATCCACTGGTAT 
ThrGlnThrLeuThrCysThrValTyrAla IleProProProHisHis IleHisTrpTyr 

TGGCAGTTGGAGGAAGAGTGCGCCAACGAG CCCAGCCAAGCTGTCTCAGTGACAAACCCA 
TrpGlnLeuGluGluGluCysAlaAsnGlu ProSerGlnAlaValSerValThrAsnPro 

TACCCTTGTGAAGAATGGAGAAGTGTGGAG GACTTCCAGGGAGGAAATAAAATTGAAGTT 
TyrProCysGluGluTrpArgSerValGlu AspPheGlnGlyGlyAsnLysIleGluVal 

AATAAAAATCAATTTGCTCTAATTGAAGGA AAAAACAAAACTGTAAGTACCCTTGTTATC 
AsnLysAsnGlnPheAlal/euIleGluGly LysAsnLysThrValSerThrLeuVallle 

CAAGCGGCAAAlXnXaTCAGCTTTGTACAAA TGTGAAGCGGTCAACAAAGTCGGGAGAGGA 
GlnAlaAlaAsnValSerAlaLeuTyrLys CysGluAlaValAsnLysValGlyArgGly 

GAGAGGGTGATCTCCTTCCACGTGACCAGG GGTCCTGAAATTACTTTGCAACCTGACATG 
GluArgVallleSerPheHisValThrArg GlyProGluIleThrLeuGlnProAspMet 

CAGCCCACTGAGCAGGAGAGCGTGTCTTTG TGGTGCACTGCAGACAGATCTACCTrTTGAG 
GlnProThrGluGlnGluSerValSerLeu TrpCysThrAlaAspArgSerThrPheGlu 

AACCTCACATGGTACAAGCTTGGCCCACAG CCTCTGCCAATCCATGTGGGAGAGTTGCCC 
AsnLeuThrTrpTyrLysLeuGlyProGln ProLeuProIleHisValGlyGluLeuPro 

ACACCTGTTTGCAAGAACTTGGATACTCTT TGGAAATTGAATGCCACCATGTTCTCTAAT 
ThrProValCysLysAsnLeuAspThrLeu TrpLysLeuAsnAlaThrMetPheSerAsn 

AGCACAAATGACATTTTGATCATGGAGCTT AAGAATGCATCCTTGCAGGACCAAGGAGAC 
SerThrAsnAspIleLeuIleMetGluLeu LysAsnAlaSerLeuGlnAspGlnGlyAsp 

WTGTCTGCCTTGCTCAAGACAGGAAGACC AAGAAAAGACATTGCGTGGTCAGGCAGCTC 
TyrValCysLeuAlaGlnAspArgLysThr LysLysArgRisCysValValArgGlnLeu 

ACAGTCCTAGAGCGTGTGGCACCCACGATC ACAGGAAACCTGGAGAATCAGACGACAAGT 
ThrValLeuGluArgValAlaProThrlle ThrGlyAsnLeuGluAsnGlnThrThrSer 
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ATTGGGGAAAGCATCGAAGTCTCATGCACG GCATCTGGGAATCCCCCrCCACAGATCATG 
IleGlyGluSerlleGluValSerCysThr AlaSerGlyAsnProPxoProGlnlleMet 

TGGTTTAAAGATAATGAGACCCTTGTAGAA GACTCAGGCATTGTATTGAAGGATGGGAAC 
TrpPheLysAspAsnGluThrl^uValGlu AspSerGlylleValLeuLysAspGlyAsn 

CGGAACCTCACTATCCGCAGAGTGAGGAAG GAGGACGAAGGCCTCTACACCTGCCAGGCA 
ArgAsnLeuThrlleArgArgValArgLys GlxxAspGluGlyLeuTyrThrCysGlnAla 

TGCAGTGTTCTTGGCTGTGCAAAAGTGGAG GCATTTTTCATAATAGAAGGTGCCCAGGAA 
CysSerValbeuGlyCysAlaLysValGlu AlaPhePhellelleGluGlyAlaGlnGlu 

AAGACGAAOTTGGAAATCATTATTCTAGTA GGCACGGCGGTGATTGCCATGTTCTTCTGG 
LysThrAsnLeuGluIlelJ ell eLeuVal Gl vThrAl m Valll eAlmMetPhePheTm 
745 

CTACTTCTTGTCATCATCCTACGGACCGTT AAGCGGGCCAATGGAGGGGAACTGAAGACA 
LeuLeuLeuValll e T 1 eLeuAraThrVA 1 LysArgAlaAsnGlyGlyGluLeuLysThr 

770 

GGCTACTTGTCCATCGTCATGGATCCAGAT GAACTCCCATTGGATGAACATTGTGAACGA 
GlyTyrLeuSer 1 1 eValMe tAspProAsp GluLeuProLeuAspGluHisCysGluAx g 

Eco Rl Hiadlll 

CTGCCTTATGATGCCAGCAAATGGGAATTC aagcttcccggggtcgacatcgatgagcag 
LeuProTyrAspAlaSerLysTrpGluPhc LysLeuProGlyValAspIleAspGluGln 

810 

Xba I 

aagctgatatccgaggaggacctgaactagtetaga 3 1 
LysLeuIleSerGluGluAspLeuAsn • -C 
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CLAIMS 

1. A two component system for use in association with one 
another comprising: 

(a) fusion protein or conjugate of a ligand with an 
enzyme ; and 

(b) a prodrug which can be converted into an active 
drug by said enzyme. 

2 . A system according to claim 1 wherein the enzyme is a 
bacterial carboxypeptidase . 

3 . A system according to claim 2 wherein the 
carboxypeptidase is carboxypeptidase CPG2 . 

4 . A system according to claim 3 wherein the CPG2 has been 

altered at one or more glycosylation sites to prevent 
glycosylation at said site or sites. 

5 . A system according to any one of claims 2 to 4 wherein 

the prodrug is a nitrogen mustard prodrug. 

6 . A system according to any one of claims 1 to 5 wherein 
the ligand is vascular endothelial growth factor (VEGF) . 

7. A fusion protein comprising a ligand and an enzyme 
capable of converting a prodrug into a toxic agent. 

8 . A fusion protein substantially as described in SEQ ID 
No. 8, 10, 12, 14, 16 or 18 or as encoded by the DNA of 
SEQ ID No. 7, 9, 11, 13, 15 or 17. 

9. A system according to claim 1 comprising a fusion 
protein according to claim 8 and a prodrug which can be 
converted into an active drug by CPG2 . 
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10. A system according to any one of claims 1 to 6 or 9 for 
use in a method of treatment or therapy of the human or 
animal body. 

Use of a fusion protein or conjugate of a ligand with an 
enzyme capable of binding to a tumour and a cell and a 
prodrug capable of being converted by said enzyme to an 
active drug, in the manufacture of a medicament for use 
in the treatment of the tumour. 

A process for preparing a two component system for use 
in association with one another comprising: 

(a) fusion protein or conjugate of a ligand with an 
enzyme; and 

(b) a prodrug which can be converted into an active 
drug by said enzyme; 

said process comprising preparing a fusion protein or 
conjugate of a ligand with an enzyme, 

and placing said fusion protein or conjugate in 
association with a prodrug which can be converted into 
an active drug by said enzyme. 

A process for preparing a fusion protein comprising a 
ligand and an enzyme capable of converting a prodrug 
into a toxic agent, said process comprising culturing a 
host cell which contains an expression vector which 
comprises a nucleic encoding said fusion protein, said 
culturing being carried out under conditions in which 
said fusion protein is expressed, and recovering the 
fusion protein in substantially isolated form. 

14. A nucleic acid encoding a fusion protein as claimed in 
claim 7 or 8 . 



11 . 

5 



12 



10 



15 



20 13 . 



25 



30 15. A vector comprising a nucleic acid as claimed in claim 
14 . 



WO 97/26918 



PC77GB97/00221 



16. A vector as claimed in claim 15 comprising a bacterial 
carboxypeptidase gene which has been altered by 
substitution, deletion or insertion at one or more 
glycosylation sites, fused in- frame to a gene encoding 
a ligand or fragment or derivative thereof. 

17. A host cell which contains an expression vector 
comprising a vector as claimed in claim 15 or 16 
operably linked to a promoter compatible with the host 
cell for expression of the fusion protein. 

18. A product containing; 

(a) fusion protein or conjugate of a ligand with 
an enzyme ; and 

(b) a prodrug which can be converted into an 
active drug by said enzyme; 

as a combined preparation for separate or sequential use 
in the treatment of a tumour. 

19. A method of treating a tumour comprising administering 
to a subject in need thereof an effective amount of a 
fusion protein or conjugate of a ligand with an enzyme 
capable of binding to a tumour and a cell and a prodrug 
capable of being converted by said enzyme to an active 
drug, in the manufacture of. a medicament for use in the 
treatment of the tumour. 
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Fig.3. 
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Fig.7. 
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Fig.8. 
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Fig.9A. 
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